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Tbe Golden Guess 
Is Morning-Star to tbe full round of Truth. 

-* Tennyson. 



Preface 

This text represents an extensive expansion and revision, both in scope and 
content, of the earlier work entitled “Military Cryptanalysis, Part I" by William 
F. Friedman. This expansion and revision was necessitated by the considerable 
advancement made in the art since the publication of the previous text. 

I wish to express grateful acknowledgment for Mr. Friedman’s generous as- 
sistance and invaluable collaboration in the preparation of this edition. I also 
extend particular appreciation to my colleague Robert E. Cefail for his numerous 
valuable comments and assistance in writing tbe new material which is contain- 
ed herein. 



- L. D. C. 
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r . _ .SECTION I 
'‘Bfi^ODUCTORY ' REMARKS ‘ 

- .* . . ' J- • 

— .. .. . Paragraph 



Scope of this text . 1 

Mental equipment necessary for crypt analytic work 2 

Validity of results of cryptanalys is ........ . 3 



1. Scope of this tex t. — a This text constitutes the first of a 

series of six basic texts on the art of cryptanalysis . Although most of 
the information contained herein is applicable to cryptograms of various 
types and sources, special emphasis will be laid upon the principles and 
methods of solving military^ cryptograms. Except for an introductory 
discussion of fundamental principles underlying the science of crypt ana- 
lytics , this first text in the series will deal solely with the principles 
and methods for the analysis of monoalphabetic substitution ciphers . 

Even with this limitation it will be possible to discuss only a few of the 
many variations of this one type that are met in practice; but with a 
firm grasp upon the general principles few difficulties should be expe- 
rienced with any modifications or variations that may be encountered. 



b. This and some of the succeeding texts will deal only with basic 
types of cryptosystems not because they may be encountered unmodified in 
military operations but because their study is essential to an under- 
standing of the principles underlying the solution of the modern, very 
much more complex types of codes, ciphers, and certain encrypted trans- 
mission systems that are likely to be employed by the larger governments 
of today in the conduct of their military affairs in time of war. 



c . It is presupposed that the student has no prior background in 
the field of cryptology; therefore cryptography is presented concurrently 
with cryptanalysis . Basic terminology and preliminary cryptologic consi- 
derations are treated in Section II; other terms are usually defined upon 
their first occurrence , or they may be found in the Glossary (Appendix l) . 

d. The cryptograms presented in the examples embrace messages from 
hypothetical air, ground, and naval traffic; thus, the student will have 
the opportunity to familiarize himself with the language and phraseology 
of all three Services comprising the Armed Forces of the United States. 



1 Each text has its accompanying course in cryptanalysis, so that the 
student may test his learning and develop his skill in the solution of 
the types of cryptograms treated in the respective texts. The problems 
which pertain to this text, constitute Appendix 13. 

2 The word "military” is here used in its broadest sense. In this 
connection see subpar. d, below. 
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2. Mental equipment necessary for cryptanalytic work , —a . Captain 
Parker Hitt, in the first United Stakes Army manual^ dealing with cryp- 
tology, opens the first chapter of hi3 valuable treatise with the 
following sentence: 

"Success in dealing with unknown ciphers is measured by these four 
things in the order named: perseverance, careful methods of analysis, 

intuition, luck." 

These words arc as true today as they were then. There is no royal 
road to success in the solution of cryptograms. Hitt goes on to say: 

"Cipher work wi3 1 have little permanent attraction for one who 
expects results at once, without labor, for there is a vast aaount of 
purely routine labor in the preparation of frequency tables, the rearrange- 
ment of ciphers for examination, and the trial and fitting of letter to 
letter before the message begins to appear." 

The present author deems it advisable to add that the kind of work 
involved in solving cryptograms is not at all similar to that involved in 
solving crossword puzzles, for example. The wide vogue the latter have 
had and continue to have is due to the appeal they make to the quite 
common interest in mysteries of one sort or another; but in solving a 
crossword puzzle there is usually no necessity for performing any preli- 
minary labor, and palpable results become evident after the first minute 
or two of attention. This successful start spurs the crossword "addict" 
on to complete the solution, which rarely requires more than an hour's 
time. Furthermore, crossword puzzles are all alike in basic principles 
and once understood, there is no more to learn. Skill comes largely from 
the embellishment of one's vocabulary, though, to be sure, constant prac- 
tice and exercise of the imagination contribute to the ease and rapidity 
with vhich solutions are generally reached. In solving cryptograms, 
however, many principles must be learned, for there are many different 
systems of varying degrees of complexity. Even some of the simpler 
varieties require the preparation of tabulations of one sort or another, 
which many people find irksome; moreover, it is only toward the very close 
of the solution that results in the form of intelligible text become 
evident. Often, indeed, the student will not even know whether he is on 
the right ".rad- until he has performed a large amount of preliminary 
"spade work" involving many hours of labor. Thus, without at least a 
willingness to pursue a fair amount of theoretical study, and a more than 
average amount of patience and perseverance , little skill and experience 
can be gained in the rather difficult art of cryptanalysis . General 
Givierge, the author of an excellent treatise on cryptanalysis, remarks 
in this connection:^- 

"The cryptanalyst's attitude must be that of William the Silent: 

No need to hope in order to undertake, nor to succeed in order to 
persevere ." 



3 Hitt, Capt. Parker, Manual for the Solution of Military Ciphers . 

Army Service Schools Press. Fort Leavenworth, Kansas, 1916. 2d Edition,' 
1918. (Both out of print.) 

^ Givici-ge, General Marcel, Cours de Cryptographie , Paris, 1925, p. 301* 
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b. As regards Hitt’s reference to careful methods of analysis, 
before one can be said to be a cryptanalyst worthy of the name it is 
necessary that one should have, firstly, a sound knowledge of the basic 
principles of cryptanalysis, and secondly, a long, varied, and active 
practical experience in the successful application of those principles. 

It is not sufficient to have read treatises on this subject. One month's 
actual practice in solution is worth a whole year's mere reading of 
theoretical principles. An exceedingly important element of success in 
solving the more intricate cryptosystems is the possession of the rather 
unusual mental faculty designated in general terms as the power of in- 
ductive and deductive reasoning . Probably this is an inherited father 
than an acquired faculty; the best sort of training for its emergence, 

if latent In the individual, and for its development is the study of 
the natural sciences, such as chemistry, physics, biology, geology, and 
the like. Other sciences such as linguistics, archaeology, and philology 
are also excellent. 

c. Aptitude in mathematics is quite important, more especially in 
the solution of ciphers and enciphered codes than in codebook reconstruc- 
tion, which latter is purely and sinrply a linguistic problem. Although 
in the early days of the emergence of the science of cryptanalytics little 
thought was given to the applications of mathematics in this field, many 
branches of mathematics and, in particular, probability and statistics, 
have now found cryptologic applications. Those portions of mathematics 
and those mathematical methods which have cryptologic applications 5 are 
known collectively as cryptomathematics . 



5 It is quite important to stress at this point that in professional 
cryptologic work the science of ‘cryptanalytics is subordinated to the 
art of cryptanalysis, just as in ,the world of music the technical virtu- 
osity of a great violinist is adjuvant to the expression of music, that 
is, the virtuosity is a "tool" for the recovery of the complete musical 
"plain text" conceived by the composer. Since the practice of cryptana- 
lysis is an art, mathematical approaches cannot always be expected to 
yield a solution in cryptology, because art can and must transcend the 
cold logic of scientific metiod. By way of example, an experienced 
Indian guide can usually find his way out of a dense forest more readily 
than a surveyor equipped with all the refined apparatus and techniques 
of his profession. Likewise, an experienced cryptanalyst can generally 
find his way through a cryptosystem more readily than a pure mathematician 
equipped merely with the techniques of his field no matter how abstruse 
or refined they may be . A cryptomathematician of repute once stated 
that "the only effect of /refined mathematical techniques/ is frequently 
to discourage one so much that one does nothing at all and some unaathe- 
matical ignoramus then gets the problem out in some very unethical way. 
This is intensely irritating." See also in this connection the remarks 
made in subpar. 27 e in reference to the validity of statistical tests 
in cryptanalysis . " - - ' 
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d. An active imagination, or perhaps what Hitt and other writers 
call intuition , is essential, hut mere imagination uncontrolled hy a 
judicious spirit will he more often a hindrance than 'a help. In prac- 
tical cryptanalysis the imaginative or intuitive facilities must, in other 
words, he guided by good judgment, hy practical experience, and hy as 
thorough a knowledge of the general situation or extraneous circumstances 
that led to the sending of the cryptogram as is possible to obtain. In 
this respect the many crypbograms exchanged between correspondents whose 
identities and general affairs, commercial, social, or political, are 
known are far more readily solved® than are isolated cryptograms exchanged 
between unknown correspondents , dealing with unknown subjects. It is 
obvious tli n +. in the former case there are good data upon which the intu- 
itive powei t of th a cryptanalyst can be brought to bear, whereas in the 
latter cas ' no such ta are available , Consequently, in the absence of 
6”ch data, no matter hew good the imagination and intuition of the cryp- 
tanalyst, these powers are of no particular service to him. Some writers, 
however, regard the intuitive spirit as valuable from still another 
viewpoint, as may be noted in the following 

"Intuition, like a flash of lightning, lasts only for a second. 

It generally comes when one Is tormented by a difficult decipherment and 
when one reviews in his mind the fruitless experiments already tried. 
Suddenly the light breaks through and one finds after a few minutes what 
previous days of labor were unable to reveal." 

This, too, is true, but unfortunately there is no way In which the 
intuition may be summoned at will, when it is most needed.® There are 
certain authors who regard as indispensable the possession of a somewhat 



6 The application in practical, operational cryptanalysis of "probable 
words" or "cribs", i.e., plain text assumed or known to be present in a 
cryptogram, is developed in time of war into a refinement the extent and 
usefulness of which cannot be appreciated by the uninitiated. Even as 
great a thinker as Voltaire found the subject of cryptanalysis stretching 
his credulity. to the point that he said: 

"Those who boast that they can decipher a letter without knowing its 
subject matter, and without preliminary aid, are greater charlatans than 
those who would boast of understanding a language which they have never 
learned ." — Dlctionnaire Philosophique , under the article "Poste". 

7 Lange et Soudart, Traite de Cryptographie , Libralre Felix Alcan, 
Paris, 1925> p. 1CA-. 

8 The following extracts are of interest in this connection: 

“The fact that the scientific investigator works 5D per cent of his time by non-rafcional means is, it seems, quite 
insufficiently recognised. There is without the least doubt an instinct for research, and often the most successful 
invcsllgntois of nature are quite unaMc to give an account of their reasons for doing such and such an experi- 
ment, or for 1 ‘lacing side by sido two apparently unrelated facts Again, one Of the most salient traits ip the 
chaiaeter of the successful scientific worker is the capacity for knowing that a point is proved when it would not 
appeal to be proved to an outside intelligence functioning in a purely rational manner; thus the investigator 
fools that some pi oposition is true, and proceeds at once to the next set of experiments without waiting and wasting 
time in the elaboration of the fouual proof of the point winch heavier minds would need. Questionless such a 
scientific intuition may and does sometimes lead investigators astray, but it is quite certain that if they did 
not widely make use of it, they would not get a quarter as far as they do. Experiments confirm each other, and a 
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rare, rather mysterious faculty that they designate by the word "flair", 
or by the expression "cipher brains" . Even so excellent an authority as 
General Givierge,9 in referring to this mental faculty, uses the fol- 
lowing words: 

"Over and above perseverance and this aptitude of mind which some 
authors consider a special gift, and which they call intuition, or even, 
in its highest manifestation, clairvoyance, cryptographic studies will 
continue more and more to demand the qualities of orderliness and memory." 

Although the present author believes a special aptitude for the 
work is essential to cryptanalytic success, he is sure there is nothing 
mysterious about the matter at all. Special aptitude is prerequisite to 
success in all fields of endeavor. There are, for example, thousands 
of physicists, hundreds of excellent ones, but only a handful of world- 
wide fame. Should it be said, then, that a physicist who has achieved 
very notable success in his field has done so because he is the fortunate 
possessor of a mysterious facility? That he is fortunate in possessing a 
special aptitude for his subject is granted, but that there is anything 
mysterious about it, partaking of the nature of clairvoyance (if, indeed, 
the latter is a reality ) is not granted. While the ultimate nature of 
any mental process seems to be as complete a mystery today as it has 
ever been, the present author would like to see the superficial veil of 
mystery removed from a sub ject v that has been shrouded in mystery from 
even before the Middle Ages down to our own times . (The principal and 
readily understandable reason“for this is that governments have always 
closely guarded cryptographic secrets and anything so guarded soon becomes 
"mysterious".) He would, rather, have the student approach the subject 
as he might approach any other science that can stand on its own merits 
with other sciences, because iSSptanalytics , like other sciences, has a 
practical importance in human "rfairs . It presents to the inquiring mind 
an interest in its own right as a branch of knowledge; it, too, holds 
forth many difficulties and disappointments, and these are all the more 



false step is usually soon discovered. And nof only by this partial replacement of reason by intuition does the 
work of science go on, but also to the born scientific worker — and emphatically they cannot be made — the struc- 
ture of the method of research is as it were given^ he cannot explain jt to you, though he may be brought to agree 
a posteriori to a formal logical presentation of the way the method works”. — Excerpt from Needham, Joseph, 
The Sceptical Biologist, London, 1929, p. 79. 

“The essence of scientific method, quite' simply, is to try to see how data arrange themselves into causal 
configurations. Scientific problems are soh ed by collecting data and by "thinking about them all the time.” 
We need to look at strange things until, by the appearance of known configurations, they seem familiar, and to 
look at familiar things until we see novel configurations which make them appear strange. We must look at 
events until they become luminous. That la scientific method .. . Insight is the touchstone .. . The appli- 
cation of insight as the touchstone of method ggables us to evaluate properly the role of imagination in scientific 
method. The scientific process is akin to the artistic process: it is a process of selecting out those elements of 
experience which fit together and recombining them in the mind. Much of this kind of research is simply a cease- 
less mulling over, and even the physical scientist has considerable need of an armchair . . . Our view of scien- 
tific method as a struggle to obtain insight forces the admission that science is half art . . . Insight is the 
unknown quantity which has eluded students of scientific method". — Exoerpts from an article entitled Insight and 
Scientific Method, by Willard Waller, in The American Journal of Sociology, Vol. XL, 1934’ 

9 Op. eit ., p. 302. 
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keenly felt when the nature of these difficulties is not understood by 
those unfamiliar with the special circumstances that very often are the 
real factors that led to success in other cases. Finally, just as in 
the other sciences wherein men labor long and earnestly for the true 
satisfaction and pleasure that comes from work well done, so the mental 
pleasure that the successful cryptanalyBt derives from his accomplish- 
ments is very often the only reward for much of the drudgery that he must 
do in his daily work. General Givierge’s words in this connection are 
well worth quoting: 10 

"Some studies will last for years before bearing fruit. In the case 
of others, cryptanalysts undertaking them ne-ver get any result. But, for 
a cryptano] yst who likes the work, the joy of discoveries effaces the 
memory of his hours of doubt and impatience." 

e. With his usual deft touch, Hitt says of the element of luck, as 
regards the role it plays in analysis: 

"As to luck, there is the old miners' proverb: ’Gold is where you 

find it.’" 

The cryptanalyst is lucky when one of the correspondents whose cryp- 
tograms he is studying makes a blunder that gives the necessary clue; or 
when he finds two cryptograms identical in text but in different keys in 
the same system; or when he finds two cryptograms identical in text but 
in different systems, and so on. The element of luck is there, to be 
sure, but the cryptanalyst must be on the alert if he is to profit by 
these lucky "breaks" . 

f . If the present author were asked to state, in view of the pro- 
gress""in the field since 1916 , what elements might be added to the four 
ingredients Hitt thought essential to cryptanalytic success, he would 
be inclined to mention the following: 

(1) A broad, general education, embodying interests covering as 
many fields of practical knowledge as possible. This is useful because 
the cryptanalyst is often called upon to solve messages dealing with 
the most varied of human activities, and the more he knows about these 
activities, the easier his task. 

(2) Access' to a large library of current literature, and wide and 
direct contacts with sources of collateral information. These often 
afford clues as- to the contents of specific messages. For example,, to 
be able instantly to have at his disposal a newspaper report or a 
personal report of events described or referred to in a message under 
investigation goes a long way toward simplifying or facilitating 
solution. Government cryptanalysts are sometimes fortunately situated 
in this respect, especially where various agencies work in harmony. 

(3) Proper coordination of effort.. This Includes the organization 

of cryptanalytic personnel into harmonious, efficient teams of cooperating 
individuals. , - 



10 °P« cit ., p. 3°1* 
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(if) Under mental equipmerit he would also include the faculty of 
■being able to concentrate on a problem for rather long periods of time, 
without distraction, nervous irritability, end impatience. The strain 
under vhich cryptanalytic studies are necessarily conducted is quite 
severe and too long -continued application has the effect of draining 
nervous energy to an unwholesome degree, so that a word or two of caution 
may not here be out of place . ^ One should continue at work only so long 
as a peaceful, calm spirit prevails , whether the work is fruitful or not. 
But just as soon as the mind becomes wearied with the exertion, or just 
as soon as a feeling of hopelessness or mental fatigue intervenes, it is 
better to stop completely and turn to other activities, rest, or play. 

It is essential to remark that systematization and orderliness of work 
_ are aids in reducing nervous tension and irritability. On this account 
it is better to take the time to prepare the data carefully, rewrite the 
text if necessary, and so on, rather than work with slipshod, incomplete, 
or improperly arranged material. 

(5) A retentive memory is an important asset to cryptanalytic skill, 
especially in the solution of codes . The ability to remember individual 
groups, their approximate locations in other messages, the associations 
they form with other groups, their peculiarities and similarities, saves 
much wear and tear of the mental machinery, as well as much time in 
looking up these groups in indexes.' 

(6) The assistance of machine aids in cryptanalysis. The importance 
and value of these aids cannot be overemphasized in their bearing on prac- 
tical, operational cryptanalysis, especially in the large-scale effort 
that would be made in time of war on complex, high-grade cryptosystems at 
a theater headquarters or in the zone of the interior. These aids, under 
the general category of rapid analytical machines, comprise both punched- 
card tabulating machinery and certain other general- and special-purpose 
high-speed electrical and electronic devices. Some of the more compact 
equipment may be employed by lower echelons within a theater of operations 
to facilitate the cryptanalysis - of medium-grade cryptosystems found in 
tactical communications. 



g. It may be advisable to add a word or two at this point to prepare 
the student to expect slight mental jars and tensions which will almost 
inevitably come to him in the conscientious study of this and the sub- 
sequent texts. The present author is well aware of the complaint of 
students that authors of texts on cryptanalysis base much of their expla- 
nation upon their foreknowledge of the "answer” — which the student does 
not know while he is attempting to follow the solution with an unbiased 
mind. They complain, too, that these authors use such expressions as "it 
is obvious that", "naturally", "of course", "it is evident that", and 
so on, when the circumstances seem not at all to warrant their use. 

There is no question that this sort of treatment is apt to discourage 
the student, especially when the point elucidated becomes clear to him 
only after many hours’ labor, whereas, according to the book, the author 
noted the weak spot at the first moment’s inspection. The present 
author can only promise to try to avoid making the steps appear to be 
much more simple than they really are, and to suppress glaring instances 
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of unjustifiable “jumping at conclusions". At the same time he must 
indicate that for pedagogical reasons in many cases a message has been 
consciously "manipulated" so as to allow certain principles to become 
more obvious in the illustrative examples than they ever are in practical 
work. During the course of some of the explanations attention will even 
be directed to cases of unjustified inferences. Furthermore, of the 
student who is quick in observation and deduction, the author will only 
ask that he bear in mind that if the elucidation of certain principles 
seems prolix and occupies more space than necessary, this is occasioned 
by the author's desire to carry the explanation forward in very short, 
easily-comprehended, and plainly -described steps, for the benefit of 
students who are perhaps a bit slower to grasp but who, once they under- 
stand, are able to retain and apply principles slowly learned just as 
well, if not better than the students who learn more quickly .H 

3. Validity of results of cryptanalysis .— Valid or authentic 
cryptanalytic solutions cannot and do not represent "opinions" of the 
cryptanalyst. They are valid only so far as they are wholly objective, 
and are susceptible of demonstration and proof, employing authentic, 
objective methods . It should hardly be necessary (but an attitude fre- 
quently encountered among laymen makes it advisable) to indicate that 
the validity of the results achieved by any serious cryptanalytic studies 
on authentic material rests upon the same sure foundations and are 
reached by the same general steps as the results achieved by any other 
scientific studies; viz. , observation, hypothesis, deduction and induction 
and confirmatory experiment. Implied in the latter is the possibility 
that two or more qualified investigators, each working independently 
upon the same material, will achieve identical (or practically identical) 
results — there is one and only one (valid) solution to a cryptogram. 
Occasionally a "would-be" or pseudo-cryptanalyst offers "solutions" which 
cannot withstand such tests; a second, unbiased, investigator working 
independently either cannot consistently apply the methods alleged to 
have been applied by the pseudo-cryptanalyst, or'eise,’if he can apply 



H In connection with the use of the word "obvious", the following 
extract is of interest: 

"How the word 'obvious 1 is a rather dangerous one. There is an 
incident, \7hieh has become something of a legend in mathematical circles, 
that illustrates this danger. A certain famous mathematician was lec- 
turing to a group of students and had occasion to use a formula which he 
wrote down with the remark, 'This statement is obvious.' Then he paused 
and looked rather hesitantly at the formula. 'Wait a moment, ' he said. 

'In it obvJous7 I think it’s obvious.' More hesitation, and then, 

'Pardon me, gentlemen, I shall return.' Then he left the room. Thirby- 
f ive minul.ee later he returned; in his hands was a sheaf of papers 
covered with calculations, on his face a look of quiet satisfaction. 

'I was right, gentlemen. It is obvious,' he said, and proceeded with his 
lecture." — Excerpt from The Anatomy of Mathematics by Kershner and Wilcox. 
Hew York, 1950. 
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them at all, the results (plaintext translations) are far different in 
the two cases . The reason for this "is that in such cases it is generally 
found that the "methods" are not clear-cut, straightforward or mathema- 
tical in character. Instead, they often involve the making of judgments 
on matters too tenuous to measure, weigh, "or otherwise subject to careful 
scrutiny. Often, too, they involve the "correction" of an inordinate 
number of "errors" which the pseudo -cryptanalyst assumes to he present and 
which he "corrects" in order to make his "solution" intelligible. And 
sometimes the pseudo-cryptanalyst offers as a "solution" plain text which 
is intelligible only to him or which he makes intelligible hy expanding 
what he alleges to be abbreviations, and so on. In all such cases, the 
conclusion to which the unprejudiced observer is forced to come is that 
the alleged "solution" obtained by the pseudo-cryptanalyst is purely 
subjective j n nearly all cases where this has happened (and they occur 
from time to time) there has been uncovered nothing which can in any way 



12 A mathematician is often tumble to grasp the concept behind the ex- 
pression "subjective solution" as used in the cryptanalytic field, since 
the idea is foreign to the basic philosophy of mathematics and thus the 
expression appears to him to represent a contradiction in terms. As an 
illustration, let us consider a' situation in which a would-be cryptanalyst 
offers a solution to a cryptogram he alleges to be a simple monoalphabetic 
substitution cipher. His so-called solution, however, requires that he 
assume the presence of, let us say," approximately 50 $ garbles (which he 
claims to have been introduced by cipher clerks’ errors, faulty radio 
reception because of adverse weather conditions, etc.). That is, the 
"plain text" he offers as the "solution" involves his making helter- 
skelter many "corrections and "emendations" , which, one may he sure, will 
be based on what his subconscious mind expects or desires to find in the 
cleartext message. Unfortunately, another would-be cryptanalyst working 
upon the same cryptogram and hypothesis independently might conceivably 
"degarble" the cryptogram in different spots and produce an entirely 
dissimilar "plain jbext" as his "solution". Both "solutions" would he 
invalid because they are based upon an erroneous hypothesis — the crypto- 
gram actually happens to he a polyalphabetic substitution cipher which 
when correctly analyzed requires on the part of unbiased observers no 
assumption of garbles to a degree that Btrains their credulity. The 
last phrase is added here because in professional cryptanalytic work it 
is very often necessary to make a few corrections for errors but it is 
rarely the case that the garble rate exceeds more than a few percent of 
the characters of thji c^^ptogram, say 5 to 10$ at the outside. It is 
to he noted, however, that occasionally the solution to a cryptogram 
may involve the correction of more than this percentage of errors, but 
the solution would be regarded as valid only if the errors can he shown 
to be systematic in some significant respect, or can otherwise be 
explained by objective rationalization. 
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"be used to impugn the integrity of the pse«do-crypt£ ualyst . The worst 
that can he said of him ic that La has 'become a victim of a special or 
peculiar form of self-delusion, and that his desire to solve the problem, 
usually in accord with some previously-formed opinion, or notion, has 
over-balanced, or undermined, his judgment and good sense .13 



V 



3 



1895. 



Los 



-*-3 Specific reference can be made to the following typical "case 
histories" : 

Donnelly, Ignatius, The Great Cryptogram . Chicago, 1888. 

Owen, Orville W., Sir Francis Bacon's Cipher Story . Detroit, 

Gallup, Elizabeth Wells, Francis Bacon's Biliteral Cipher , 

Detroit, 1900 . 

Arensbcrg, "Walter Conrad, The Cryptography of Shakespeare . 

Angeles, 1922. ~ " 

The Shakespearean Mystery . Pittsburgh, 1928 . 

The Baconian Keys . Pittsburgh, 1928. 

Margolioutb, D. S., The Homer of Aristotle . Oxford, 1923* 

^Newbold, William Romaine, The Cipher of Roger Bacon . Philadelphia, 
1928. : (For a scholarly and complete demolition of Professor 
Newbo'ld''s work, see an article entitled Roger Bacon and the 
Voynich 'MS, by John M. Manly, in Speculum, Vol. VI, No. 3, 

' July "1931.) 

/ Feely, JoBeph Martin, The Shakespearean Cypher. Rochester, N* Y. , 

A 1931. 

' Deciphering Shakespeare . Rochester, N. Y., 193 1 *- • 

^ Roger Bacon’s Cypher: the right key found . Rochester, N. Y., 19^3- 
Wolff, Werner, Dechiffrement de l^criture Maya . Paris., ‘1938. 

/Strong, Leonell C., Anthony Askham, the author of the Voynich 
V. manuscript , in Science , Vol. 101, June 15, 19^9 » PP» 608-9. 
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SECTION II 

A- “ "Tr- - *■ * 

BASIC CRYPTOLOGIC CONSIDERATIONS 



Cryptology | communication Intelligence, and communication 

security, * . ..*»..**.*».*•*• 

Secret communication . ... ...o, . . 

Plain text and encrypted teat. •*•«••*••••«•••*»... 
Cryptography, encrypting, and decrypting. .............. ....... 

Codes, ciphers, and enciphered code. .»*»•».*.*•**• 

General system, specific Isay, and cryptosystem...*..#..*..*#.# 
Cryptanalytica and cryptanalysis,. ....... ....•**••.»••••**..** 

Transposition and substitution. 

Nature of alphabets. 

Types of alphabets....... 
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4. Cryptology, communication intelligence, and communicatio n 
security . The occasional' or frequent need for secrecy in the conduct of 
important affairs has been a*ecogni!ied from time immemorial. In the case 
of diplomacy and organized irarfare this need is especially important in 
regard to communications. However, -when such communications are trans- 
mitted by electrical means, they can be heard and copied by unauthorized 
persons * The protection resulting from all measures designed to deny 
to unauthorized persons information of value -which may be derived from 
such communications is called communication security . The evaluated 
information concerning the enemy, derived principally from a study of his 
electrical communications , :Ls called communication intelligence . The coin 
lective term including all phases of communication intelligence and com- 
munication security is cryptology . 1 Or, stated in broad terms, cryptology 
is that branch of knowledge which treats of hidden, disguised, or secret^ 
communications* 



- 

± From the Greek kryptos (hidden) + logos (learning). The prefix 
"crypto-” in compound words pertains to H cryptologic " , " cryptographic" , 

Or "cryptanalytic", depending upon the use of the particular .word as 
defined* 

2 In this text the term ^secret" will be used in its ordinary sense 
as given in the dictionary. Whenever the designation is used in the more 
restricted sense of the security classification as defined in official 
regulations, it will be capitalized. There are in current use the four 
classifications Restricted, Confidential, Secret, and Top Secret, listed . 
in ascending order of degree* 
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5. Secret comunication .--a. Communication may "foe conducted by 
any means susceptible of' ultima t e Interpretation by one of the five 
senses, but those most commonly used are sight and hearing. Aside from 
the use of simple visual and auditory signals for communication over 
relatively short distances, the usual method of communication between 
or among individuals separated from one another hy relatively long dis- 
tances involves, at one stage or another, the act of writing or of 
speaking over a telephone. 

b. Privacy or secrecy in communication hy telephone can be obtained 
by using equipment which affects the electrical currents involved in te- 
lephony so that the conversations can he understood only by persons pro- 
vided with suitable equipment properly arranged for the purpose. The 
same thing is true in the case of electrical transmission of pictures, 
drawings, maps, and television images* However, this text will not treat 
of these aspectg3 of cryptology. 

c. Writing may be either visible or invisible . In the former, the 
characters are inscribed with ordinary writing materials and can be seen 
with the naked eye; in the latter, the characters are inscribed by means 
or methods which make the writing invisible to the naked eye. Invisible 
writing can be prepared with certain chemicals called Invisible, sympa- 
thetic, or secret inks, and in order to "develop" such writing, that is, 
make it visible, special processes must usually be applied. There are 
also methods of producing writing which is invisible to the naked eye 
because the characters are of microscopic size, thus requiring special 
photographic or microscopic apparatus to make such writing visible to the 
naked eye. 

d. Invisible writing and unintelligible visible writing constitute 
secreF writing . 

6. Plain text and encrypted text .— a. Visible writing which is 

intelligible, that is, conveys a more or less understandable or sensible 
meaning (in the language in which written) and which is not intended to 
convey a hidden meaning, is said to be in plain text .^ A message in 
plain text is termed a plaintext message, a cleartext message, or & 
message in clear . ~~~ 



3 These aspects of cryptology are now known as ciphony (from cipher + 
tele phony ): clfax (from cipher + facsi mile); and civision (from cipher + 
tele vision ) . ” *' " ~ 

^ Visible writing may be intelligible but the meaning it obviously 
conveys may not be its real meaning, that is, the meaning intended to be 
conveyed. To quote a simple example of an apparently innocent message 
containing a secret or hidden meaning, prepared with the intention of es- 
caping censorship, the sentence "Son born today" may mean "Three trans- 
ports left today." Messages of this type are said to be in open code . - 
Secret communication methods or artifices of this sort (concealment sys- 
tems) are impractical for field military use but are often encountered 
in espionage and counter-espionage activities. 
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b. Visible writing which conveys no intelligible meaning in any 
recognized language 5 ie said to be in encrypted text and such writing is 
termed a cryptogram.® ---- - -- 

Cryptography, enc r ypting, and decrypting . --a. Cryptography is 
that lorsnch of erotology l?iich treats" of Various means , methods ,and 
apparatus for converting or transforming plaintext messages into crypto- 
grams and for reconverting the cryptograms into their original plaintext 
forms by a simple reversal of the steps used in their transformation. 

b. To encrypt is to convert or transform a plaintext message into 

a cryptogramTyxollcjwing certain rules, steps, or processes constituting 
the key or keys and agreed upon in advance by correspondents, or furnished 
them" By- higKer authority. 

c. To decrypt is to reconvert or to transform a cryptogram into the 
original equivalent plaintext message by a direct reversal of the en- 
crypting process, that is, by applying' to the cryptogram the key or keys 
X usually In' a' reverse order) used in producing the cryptogram. 

d. A person skilled in the art of encrypting and decrypting, or one 
who has a part in devising a cryptographic system is called a crypto- 
grapher; a clerk who encrypts and decrypts, or who assists in such work, 
is called a cryptographic clerk . 

8. Codes, ciphers, and enciphered code .— a. Encrypting and de- 
crypting are accomplished by'means collectively" designated as codes and 
ciphers . Such means are used for either or both of two purposes; (l) se- 
crecy, and ( 2 ) economy or brevity. Secrecy usually is far more important 
in military cryptography tlian economy or brevity. In ciphers or cipher 
systems t cryptograms are pi’oduced by applying the cryptographic treatment 
to individual letters of the plaintext messages, whereas, in codes or code 
systems » cryptograms are produced by applying the cryptographic treatment 
to entire words, phrases, and sentences of the plaintext messages. The 
specialized' meanings' of the terms" co5e and cipher are explained in detail. _ 
later. 

b. A cryptogram produced by means of a cipher system is said to be 
in cipher and is called a cipher message, or sometimes simply a cipher , 
ttie act or operation of en'ixyp't'i'ng" a ’ cipher message is called enciphering. 



^ There is a certain typo of writing which is considered by its authors 
td be intelligible, but which is either completely unintelligible to the 
wide variety of readers or else requires considerable mental struggle on 
their part to make it intelligible. Reference is here made to so-called 
"modern literature" and "modem verse", products of such writers as 
E. E, Cummings, Gertrude Stein, James Joyce, et al. 

^ ^ om kryptos + gramma ' (that which is written). Analogous terminol- 
ogy would call a plaintext message a phanerogram (phaneros = visible, 
manifest, open). ” ” “ “ 
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and the enciphered version of the plain text, as ■well as the act or pro- 
cess Itself, is often referred to as the encipherment * The cryptographic 
clerk who performs the process serves as an encipherer . The corresponding 
terms applicable to the decrypting of cipher messages are deciphering, 
decipherment , and decipherer . A clerk who serves as both an encipherer 
anil decipherer of messages is called a cipher clerk . 

c. A cipher device is a relatively simple mechanical contrivance for 
eneipEerment and decipherment, usually "hand-operated" or manipulated by 
the fingers, as for example a device with concentric rings of alphabets, 
manually powered; a cipher machine is a relatively complex apparatus or 
mechanism for encipherment and decipherment, usually equipped with a 
typewriter key board and often requiring an external power source. 

d. A cryptogram produced by means of a code system is said to be in 
code and is called a code message . The text of the cryptogram is referred 
to as code text . This act or operation of encrypting, is called encoding, 
and the encoded version of the plain text, as well as the act or process 
itBelf , is referred to as the encodement . The clerk who performs the pro- 
cess serves as an encoder . The corresponding terms applicable to the 
decrypting of code messages are decoding , decodement, and decoder . A 
cryptographic clerk who serves as both an encoder and decoder of messages 
is called a code clerk . 

e. Sometimes, for special purposes (usually increased security), 
the code text of a cryptogram undergoes a further step in concealment 
involving superencryption , that is, encipherment of the characters com- 
prising the code text, thus producing what is called an enciphered-code 
message, or enciphered code . Encoded cipher, that is, the case' where' the 
final cryptogram is produced by enciphering the plain text and then en- 
coding the cipher text obtained from the first operation, is also possible, 
but rare. 

9. General system, specific key, and cryptosystem .— a. There are 
a great many different methods' of encrypting messages, so that correspon- 
dents must first of all be in complete agreement as to which of them will 
be used in their secret communications, or in different types or classes 
of such communications. Furthermore, it is to be understood that all the 
detailed rules, processes, or steps comprising the cryptography agreed 
upon will be invariant, that is, constant or unvarying in their use in a 
given set of communications. The totality of these basic, invariable 
rules, processes, or steps to be followed in encrypting a message according 
to the agreed method constitutes the general cryptographic system or, more 
briefly, the general system . 

b. It is usually the case that the general system operates in con- 
nection with or under the control of a number, a group of letters, a word, 
a phrase, or sentence which is used as a key, that is, the element which 
specifically governs the manner in which the general system will be applied 
in a specific message, or the exact setting of a cipher device or a cipher 
machine at the initial point of encipherment or decipherment of a specific 
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message. This element— usually of a variable nature or changeable at the 
will of the correspondents, or prearranged for them by higher authority— 
is called the specific key . The specific key may also involve the use of 
a set of specially prepared tables, a special document, or even a book. 

c. The tens cryptosystem? is used when it is desired to designate 
or refer to all the eryptomaterial (device, machine, instructions for use, 
key lists, etc.) as a unit to provide a single, complete system and means 
for secret communication. 

10. Cryptanalytics and cryptanalysis ."— a. In theory any cryptosystem 

(except one°) can be "broken”, l.e., BOlved, if enough time, labor, and 
skill are devoted to it, and if the volume of traffic in that system is 
large enough. This can be done even if the general system and the speci- 
fic key are unknown at the start. In military operations theoretical 
rules must usually give way to practical considerations. How the theo- 
retical rule in this case is affected by practical considerations will 
be discussed in Appendix 11. 

b. That branch of cryptology which dealB with the principles, 
methods, and means employed in the solution or analysis of cryptosystems 
is called cryptanalytic s . 

c# The steps and operations performed in applying the principles of 
cryptanalytics constitute c ryptanalysis . To cryptanalyze a cryptogram 
is to solve it by cryptanalysis. 

d. A person skilled in the art of cryptanalysis is called a crypt- 

analyst, and a clerk who assists in such work is called a cryptanalybic 
clerk . .. 

11. Transposition and substitution .— a. Technically there are only 
two distinct -types' of treatment' wEicK may Fe applied to written plain 
text to convert it into secret text, yielding two different classes of 
cryptograms. In the first, called transposition, the elements or units 
of the 'plain text retain their original identities and merely undergo 
some change in their relative positions, with the result that the original, 
text becomes unintelligible. In the second, called substitution, the 
elements of the plain text retain their original relative positions but 
are replaced by other elements with different values or meanings, with 
the result that the origin? si text becomes unintelligible . Thus, in the 
case of transposition ciphers, the 'unintelligibility is brought about 
merely by a change in the original sequence of the elements or units of 



? The term cryptosystem is used in preference to cryptographic system 
so as to permit its use in designating secret communication systems 
involving means other than writing, such as ciphony and cifax. 

® The exception is the 7one-tlme" system in which the key is used only 
once and in itself must have no systematic construction, derivation, or 
meaning. 
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the plain text; in the case of substitution ciphers, the unintelligi- 
bility is "brought about by a change in the elements or units themselves, 
•without a change in their relative order. 

b. It Is possible to encrypt a message by a substitution method and 
then to apply a transposition method to the substitution text, or vice 
versa. Such combined transposition-Bubstltution methods do not form a 
third class of methods. They are occasionally encountered in military 
cryptography, but the types of combinations that are sufficiently simple 
to be practicable for field use are very limited .9 

£. Under each of the two principal classes of cryptograms as out- 
lined~above, a further classification can be made based tip on the number - 
of characters composing the textual elements or units undergoing crypto- 
graphic treatment. These textual units are composed of (l) individual 
letters, (2) combinations of letters in regular groupings, (3) combina- 
tions of letters in irregular, more or less euphonious groupings called 
syllables, and (4) complete words, phrases, and sentences. Methods which 
deal with the first type of units are called monographic methods} those 
which deal with the second type are called polygraphic ( digraphic , tri- 
graphic, etc.); those which deal with the third type, or syllables, are 
called syllabic} and, finally, those which deal with the fourth type are 
called lexical (of or pertaining to words). 

d. It is necessary to indicate that the foregoing classification of 
cryptographic methods is more or less artificial in nature, and is estab- 
lished for purpose of convenience only. Ho sharp line of demarcation can 
be drawn In every case, for occasionally a given system may combine me- 
thods of treating single letters, regular or irregular-length groupings 
of letters, syllables, words, phrases, and complete sentences. When in 
a single system the cryptographic treatment is applied to textual units 
of regular length, usually monographic or digraphic (and seldom longer, 
or Intermixed monographic and digraphic), the system is called a cipher 
system . Likewise, when in a single system the cryptographic treatment is 
applied to textual units of irregular length, usually syllables, whole 
words, phrases, and sentences, and is only exceptionally applied to single 
letters or regular groupings of letters, the system is called a code 
system and generally involves the use of a code book . 3 - 0 

12. Hature of alphabets .—a. One of the simplest kinds of substitu- 
tion ciphers is that which is known, in cryptologic literature as Julius 
Caesar's Cipher, but Which, as a matter of fact, was a favorite long 
before his day. In this cipher each letter of the text; of a message is 
replaced by the letter standing the third to the right of it in the 



9 One notable exception is the ADFGVX system, used extensively by the 
Germans in World War I. See in this connection the Cryptographic 
Supplement (Appendix j), - • 

10 A list of single letters, frequent digraphs, tri graphs, syllables, 
and words is often called a syllabary} cryptographic treatment of the 
units of such syllabaries places them in the category of code systems. 



* 



RESTRICTED 



16 



REF ID:A56892 



RESTRICTEB - 

_ , - - *■ -f. ' r r-: : - r - s'*!'"-' or* - 7 -^'*-' o.^-. v j .x .1 

ordinary alphabet; the letter A is replaced by D, the letter B by E, 

and so on. The word cab becomes converted into FDE, which is cipher. 

b. The English langusage is written by means' of 2 6 single characters 
called letters which, taken together and considered as a sequence of 
symbols , constitute the alphabet of the language. Not all systems' 'of 
writing are of this nature. Chinese writing is composed of about 4^,000 
complex characters, each representing one sense of a word. Whereas 
English words are composite or polysyllabic and may consist of one to 
eight or, more syllables, Chinese words are all monosyllables and each 
monosyllable is e word. Written languages of the majority of other 
civilized peoples of today are, however, alphabetic and polysyllabic in 
construction, so that the principles discussed here apply to all of them. 

<c. The letters comprising the English alphabet used today are the 
results of a long period of evolution, the complete history of which may 
never fully be known. 11 They are conventional symbols representing 
elementary sounds, and any other simple symbols, so long as the sounds 
which they represent are agreed upon by those concerned, will serve the 
purpose equally well. If taught from early childhood that the symbols 
*, and @ represent the sounds "Ay”, "Bee", and "See" respectively, the 
combination would still be pronounced cab , and would, of course, have 
exactly the same meaning as before. Again, let us suppose that two per- 
sons have agreed to change the sound values of the letters F, G, and H, 
and after long practice have become accustomed to pronouncing them as we 
pronounce the letters A, B, and C, respectively; they would then write 
the "word" HFG, pronounce it cab , and see nothing strange whatever in the 
matter. But to others no party to their arrangements, BFG constitutes 
cipher. The combination of sounds called for by this combination of 
symbols is perfectly intelligible to the two who have adopted the new 
sound values for those symbols and therefore pronounce HFG as cab ; but 
HFG is utterly unpronounceable anc* wholly unintelligible to others who 
are reading it according to their own long-established system of sound 
and symbol equivalents. It would be stated that there is no such word 
as HFG, which would mean merely that the particular combination of sounds 
represented by this combination of letters has not been adopted by con- 
vention to represent a thing or an idea In the English language. Thus, 
it is seen that, in order for the Witten words of a language to be 
pronounceable and intelligible to all who speak that language, it is 
necessary, first, that the sound values of the letters or symbols be 
universally understood and agreed upon and, secondly, that the particular 
combination of sounds denoted by the letters should have been adopted to 
represent a thing or an idea. Spoken plain language consists of vocables; 
that is, combinations and permutations of elementary speech-sounds which 
have by long usage come to be adopted and recognized as representing 
definite thingB and ideas . Written plain language consists of words ; 
that is, combinations and permutations of simple symbols, c ailed Tester s , 
which represent visually and call forth vocally the elementary speech- 
sounds of which the spoken language is composed. 



11 An excellent and most authoritative hook on this subject is The Al- 
phabet; a key to the history of Mankind by David Diringer. London,' 1^49 . 
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ordinary alphabet; the letter A is replaced by D, the letter B by E, 
and so on. The word cab becomes converted into IDE, which is cipher. 

b. The English language is written by moans of 2 6 single characters 
callecT letters which, taken together and considered as a sequence of 
symbols , constitute the alphabet of the language. Not all systems' of 
writing are of this nature. Chinese writing is composed of about 4V,000 
complex characters, each representing one sense of a word. Whereas , 
English words are composite or polysyllabic and may consist of one to 
eight or' more syllables, Chinese words are’ all monosyllables and each 
monosyllahle is s word. Written languages of the majority of other 
civilized peoples of today are, however, alphabetic and polysyllabic in 
construction, so that the principles discussed here apply to all of them. 

£. The letters comprising the English alphabet used today are the 
results of a long period of evolution, the complete history of which may 
never fully be known. li - They are conventional symbols representing 
elementary sounds, and any other simple symbols, so long as the sounds 
which they represent are agreed upon by those concerned, will serve the 
purpose equally well. If taught from early childhood that the symbols 
$, *, and @ represent bhe sounds "Ay", "Bee", and "See" respectively, the 
combination @$* would still be pronounced cab , and would, of course, have 
exactly the same meaning as before. Again, let us suppose that two per- 
sons have agreed to change the sound values of the letters F, G, and H, 
and after long practice have become accustomed to pronouncing them as we 
pronounce the letters A, B, and C, respectively; they would then write 
the "word" HFG, pronounce it cab, and see nothing strange whatever in the 
matter. But to others no party to their arrangements, HFG constitutes 
cipher. The combination of sounds called for by this combination of 
symbols is perfectly intelligible to the two who have adopted the new 
sound values for those symbols and therefore pronounce HFG as cab ; but 
HFG is utterly unpronounceable anr* wholly unintelligible to others who 
are reading it according to their own long-established system of sound 
and symbol equivalents. It would be stated that there is no such word 
as HFG, which would mean merely that the particular combination of sounds 
represented by this combination of letters has not been adopted by con- 
vention to represent a thing or an idea in the English language. Thus, 
it is seen that, in order for the written words of a language to be 
pronounceable and intelligible to all who speak that language, it is 
necessary, first, that the sound values of the letters or symbols be 
universally under stood and agreed upon and, secondly, that the particular 
combination of sounds denoted by the letters should have been adopted to 
represent a thing or an idea. Spoken plain language consists of vocables* 
that is, combinations and permutations of elementary speech-sounds which, 
have by long usage come to be adepted and recognized as representing 
definite things and ideas. Written plain language consists of words ; 
that is, combinations and permutations of simple symbols, called letters, 
which represent visually and call forth vocally the elementary speech- 
sounds of which the spoken language is composed. 



11 An excellent and most authoritative book on this subject is The Al- 
phabet: a key to the history of Mankind by David Diringer. London, 19^9 . 
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ordinary alphabet; the letter A is replaced by D, the letter B by E, 
and so on. The word c ab becomes converted into IDE, which is cipher. 

b. The English language is written by means of 2 6 simple characters 
callecT letters which, taken together and considered as a sequence of 
symbols , constitute the alphabet of the language. Not all systems' of 
•writing are of this nature. Chinese writing is composed of about Wt-,000 
complex characters, each representing one sense of a word. Whereas . 
English words are composite or polysyllabic and may consist of one to 
eight or more syllables, Chinese words are all monosyllables and each 
monosyllable is e word. Written languages of the majority of other 
.civilized peoples of today are, however, alphabetic and polysyllabic in 
construction, so that the principles discussed here apply to all of them. 

c_. The letters comprising the English alphabet used today are the 
results of a long period of evolution, the complete history of which may 
never fully be known. 11 They are conventional symbols representing 
elementary sounds, and sny other simple symbols, so long as the sounds 
which they represent are agreed upon by those concerned, will serve the 
purpose equally veil. If taught from early childhood that the symbols 
$, *, and @ represent the sounds "Ay”, "Bee”, and "See” respectively, -the 
combination would still be pronounced cab , and would, of course, have 
exactly the same meaning as before. Again, let us suppose that two per- 
sons have agreed to change the sound values of the letters F, G, and H, 
and after long practice have become accustomed to pronouncing them as we 
pronounce the letters A, B, and C, respectively; they would then write 
the "word" BEG , pronounce it cab , and see nothing strange whatever in the 
matter. ButTTo others no party to their arrangements, EFG constitutes 
cipher. The combination of sounds called for by this combination of 
symbols is perfectly intelligible to the two who have adopted the new 
sound values for those symbols and therefore pronounce HFG as cab ; but 
BFG i3 utterly unpronounceable anr 1 wholly unintelligible to others who 
are reading it according to their own long-established system of sound 
and symbol equivalents. It would be stated that there is no such word 
as HFG, which would mean merely that the particular combination of sounds 
represented by this combination of letters has not been adopted by con- 
vention to represent a thing or an idea in the English language. Thus, 
it is seen that, in order for the written words of a language to be 
pronounceable and intelligible to all who speak that language, it is 
necessary, first, that the sound values of the letters or symbols be 
universally understood and agreed upon and, secondly, that the particular 
combination of sounds denoted by the letters should have been adopted to 
represent a thing or an idea. Spoken plain language consists of vocables; 
that is, combinations and permutations of elementary speech-sound's which.' 
have by long usage come to be adopted and recognized as representing 
definite things and ideas , Written plain language consists of words ; 
that is, combinations and permutations of simple symbols, called letters, 
which represent visually and call forth vocally the elementary speech- 
sounds of which the spoken , language is composed. 



11 An excellent and most authoritative book on this subject is The Al- 
phabet; a key to the history of Mankind by David Diringer. London," 1^9'. 
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d. It is clear also that in order to -write a polysyllabic language 
with facility it is necessary to establish and to maintain by common 
agreement or convention, equivalency between two sets of elements, first 
a set of elementary sounds and, second, a set of elementary symbols to 
represent the sounds. When this is done the result is what is called an 
alphabet , a word derived from the names of the first two letters of the 
Greek alphabet, "alpha" and "beta". 



t 



e. Theoretically, in an ideal alphabet each symbol or letter would 
denote only one elementary sound, and each elementary sound would inva- 
riably be represented by the same symbol. But such an alphabet would be 
far too difficult for the average person to use. It has been conserva- 
tively estimated that a minimum of 100 characters would be necessary for 
English alone. Attempts toward producing and introducing into usage a 
practical, scientific alphabet have been made, one being that of the 
Simplified Spelling Board in 1928, which advocated a revised alphabet of 
42 characters* Were such an alphabet adopted into current usage, in 
books, letters, telegrams, etc., the flexibility of cryptographic systems 
would be considerably extended and the difficulties set in the path of 
the enemy cryptanalysts greatly increased. The chances for its adoption 
in the near future are, however, quite small. Because of the continually 
changing nature of every living language, it is doubtful whether an 
initially "perfect alphabet" could, over any long period of time, remain 
so and serve to Indicate with great precision the exact sounds which It 
was originally designed to represent. 



13. Types of alphabets .— a. In the study of cryptography the dual 
nature of the alphabet become s~app arent . It consists of two parts or 
components, (l) an arbitrarily-arranged sequence of sounds, and (2) an 
arbitrarily-arranged sequence of symbols. 

b. The normal alphabet for any language is one in which these two 
components are" tke" ’ordinary” sequences that have been definitely fixed by 
long usage or convention. The dual nature of our normal or everyday 
alphabet is often lost sight of. When we write A, B, G, . . . we really 
mean: 

Sequence of sounds; "Ay" "Bee" "See" .... 

Sequence of symbols; A B C .... 

Normal alphabets of different languages vary considerably in the number 
of characters composing them and the arrangement or sequence of the 
characters. The English, Dutch, and German alphabets each have 2 6 ; the 
French. 25 j the Italian, 21 j the Spanish, 27 (including the digraphs CH 
and LL); and the Russian, 31.12 The Japanese language has a syllabary 
consisting of 72 syllabic sounds which require 48 characters for their 
representation. 



12 In contrast to the foregoing alphabets, it is of interest to note 
that in the Hawaiian language the alphabet consists of only 12 letters, 
viz , the five vowels A, E, I, 0, U, and the seven consonants H, K, L, 

M, N, P, W. 
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l4. The role of cryptanalysis in communication intelligence 
operations . — a. Through the medium of communication intelligence an at- 
tempt is made”to answer three questions concerning enemy communications: 
"Who?". "Where?" "What?" — Who are their 1 originators and addressees? 

Where are these originators and addressees located? What do the messages 
say? “ 

b. All of the foregoing questions are very important in the military 
application of communication intelligence. Hence, even though this text 
deals almost exclusively with the principles and operations involved in 
deriving the answer to the third question— "What do the messages say?” — 
a few words on the importance of the first and second questions may he 
useful. It is a serious mistake to think that one can necessarily and 
always correctly interpret the mere text of a message without identifying 
and locating the originator ana the addressee or, on many occasions, with- 
out having a background against which to interpret the message in order 
to appreciate its real import or significance. 

£. The very first step in the series of activities involved in de- 
riving communication intelligence is the collection of the raw material, 
that is, the interception 1 and copying of the transmissions constituting 
the messages to he studied and analyzed. 

d. Then, with the raw material in hand, studies are made in order 
to answer the first two questions— "Who?" and "Where?" The answers to 
these questions are not always obvious in modern military communications, 
especially in the case of messages exchanged by units in the combat zone, 
since messages of this sort rarely indicate in plain lan g ua g e who the 



1 intercept means, in its cryptologic sense, to gain possession of 
communications which are intended for other recipients, without obtaining 
the consent of these addressees and without preventing or ordinarily 
delaying the transmission of the communications to them. 
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originator and the addressee are or where they are located. Consequently, 
certain apparatus and techniques specifically developed for finding the 
answers to these questions must he employed. These apparatus and tech- 
niques are embraced by that part of communication intelligence theory and 
practice which is known as traffic analysis . This latter subject and 
interception are treated briefly in Appendix 10, "Communication intelli- 
gence operations” . (The serious student will derive much practical 
benefit from a careful reading of this appendix.) 

e. The foregoing operations, interception ahd traffic analysis, 
along"" with eryptanalys is constitute the first three operations of commu- 
nication intelligence. But generally there must follow at least one addi- 
tional operation. If the plain texts recovered through cryptanalysis are 
in a foreign language, they must usually be translated, and translation 
constitutes this fourth operation. In the course of translating, it may 
be found that, because of errors in transmission or reception, corrections 
and emendations must be made in these plain texts; however, although this 
often requires skill and experience of a high order, it does not consti- 
tute another communication intelligence operation, since it is but an 
auxiliary step to the process of translation. 

f . In a large-scale communication intelligence effort these four 
steps, interception, traffic analysis, cryptanalysis, and translation, 
must he properly organized and coordinated in order to gain the most bene- 
fit from the potentialities of communication intelligence, that is, the 
production of the maximum quantity of information from the raw traffic . 
This information must then be evaluated by properly trained intelligence 
specialists, collated with intelligence derived from other sources, and, 
finally, disseminated to the commanders who need the intelligence in time 
to be of operational use to them, rather than of mere historical interest. 
The foregoing operations and especially the first three — interception, 
traffic analysis, and cryptanalysis— usually complement one another. 

This, however. Is not the place for elaboration on the interrelationships 
which exist and which when properly integrated make the operations as a 
whole an efficient, Unified complex geared to the fulfillment of its prin- 
cipal goal, namely, the production of timely communication intelligence. 

g. With the foregoing general background, the student is prepared 

to proceed to the technical considerations and principle's of cryptanalysis , 

15. The four basic operations in cryptanalysis . — a. The solution of 
practically every cryptogram involves four fundamental operations or steps: 

(1) The determination of the language employed in the plaintext 
version. 

( 2 ) The determination of the general system of cryptography employed. 

(3) The reconstruction of the specific key in the case of a cipher 
system, or the reconstruction, partial or complete, of the code book, in 
the case of a code system; or both, in the case of an enciphered code 
system. 

(4) The reconstruction or establishment of the plain text. 
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b . These operatiors will "be taken up in the order in which they are 
given above and in which they usually are performed in the solution of 
cryptograms, although occasionally the second step may precede the firsts 

- 16. The determination of the language employed . — a. There is not 
much that need he said with respect to this operation except that the 
determination of the language employed seldom comes into question in the 
case of studies made of the cryptograms of an organized enemy. By this 
is meant that during wartime the enemy is of course known, and it follows, 
therefore, that the language he employs in his messages will almost cer- 
tainly he his native or mother tongue . Only occasionally nowadays is 
this rule broken. Formerly it often happened, or it might have indeed 
been the general rule, that the language used in diplomatic correspondence 
was not the mother tongue, hut French. In isolated instances during 
World War I the Germans used English when their own language could for 
one reason or another not he employed. For example, for a year or two 
before the entry of the United States into that war, during the time 
America was neutral and the German Government maintained its embassy in 
Washington, some of the messages exchanged between the Foreign Office in 
Berlin and the Embassy in Washington were encrypted in English, and a 
copy of the code used was deposited with the Department of State and our 
censor. Another instance is found in the case of certain Hindu conspira- 
tors who were associated with and partially financed by the German Govern- 
ment in 1915 and 1916 ; they employed English as the language of their 
cryptographic messages „ Occasionally the cryptograms of enemy agents may 
he in a language different from that of the enemy. But in general these 



1 Although the foregoing four steps represent the classical or ideal 
approach to crypt analys is, the art may he reduced to the following: 



Procedures tn cryptanalysis 

1. Arrangement and rearrangement of 
data to disclose non-random cha- 
racteristics or manifestations 
(i.e., in frequency counts, re- 
petitions, patterns, symmetrical 
phenomena , etc . ) . 

2. Recognition of the non -random 
characteristics or manifestations 
when disclosed. 



Requirements 

Experience or ingenuity, 
and time (which latter may 
he appreciably lowered by 
the use of machine aids in 
cryptanalysis) . 

Experience or statistics. 



3. Explanation of the non-random 
characteristics when recognized. 



Experience or imagination, 
and intelligence. 



In all of the foregoing, the element of luck plays a very important part, 
as it is possible to side-step a large amount of labor and effort, in 
many cases, if "hunches" or intuition lead the analyst forthwith to the 
right path. Therefore, the phrase "or luck" should he added to each of 
the requirements above. 

In fact, it all boils down to the simple statement: "Find something 

significant, and attach some significance thereto." 
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are, as has been said, isolated instances j as a rule, the language used 
in cryptograms exchanged between members of large organizations is the 
mother tongue of the correspondents. Where this is not the case, that is, 
when cryptograms of unknown origin must be studied, the cryptanalyst 
looks for any indications on the cryptograms themselves which may lead to 
a conclusion as to the language employed. Address, signature, and other 
data, If in plain text in the preamble, in the body, or at the end of the 
cryptogram, all come under careful scrutiny, as well as all extraneous' 
circumstances connected with the manner j.r which the cryptograms were 
obtained, the person on whom they were found, or the locale of', their 
origin and destination. 

b. In special cases, or under special circumstances a clue to the 
language employed is found in the nature and composition of the crypto- 
graphic text itself. For example, if the letters K and W are entirely 
absent or appear very rarely in messages, it may indicate that the lan- 
guage is Spanish, for these letters are absent in the alphabet of that 
language and are used only to spell foreign words or names . The presence 
of accented letters or letters marked with special signs of one sort or 
another, peculiar to certain languages, will sometimes indicate the lan- 
guage used. The Japanese Morse telegraph alphabet and the Russian Morse 
telegraph alphabet contain combinations of dots and dashes which are 
peculiar to those alphabets and thus the interception of messages con- 
taining these special Morse combinations at once indicates the language 
involved. Finally, there are certain peculiarities of alphabetic lan- 
guages which. In certain types of cryptograms, viz ., pure transposition, 
give clues as to the language used. For example, the frequent digraph CH, 
in German, leads to the presence, in cryptograms of the type mentioned, 

of many isolated C*s and H’s; if this is noted, the cryptogram may be 
assumed to be in German. 

c. In some cases it is perfectly possible to perform certain steps 
in cryptanalysis before the language of the cryptogram has been definitely 
determined. Frequency studies, for example, may be made and analytic 
processes performed without this knowledge, and by a cryptanalyst wholly 
unfamiliar with the language even if it has been identified, or who knows 
only enough about the language to enable him to recognize valid combina- 
tions of letters, syllables, or a few common words in that language. He 
may, after this, call to his assistance a translator who may not be a 
cryptanalyst but who can materially aid in making necessary assumptions 
based upon his special knowledge of the characteristics of the language 

in question. Thus, cooperation between cryptanalyst and translator 
results In solution.^ 



— The writer has seen in print statements that "during World War_I ^ . decoded messages in Japanese 

and Russian without knowing a word of either language." The ext it to which such statements are exaggerated 
will soon become obvious to the student. Of course, there are occasional instances m which a mere clerk with 
quite limited experience may be able to “solve” a message in an extremely simple system in a language of which 
lie has no knowledge at all; but such a "solution” calls for nothing more arduous than the ability to recognizo 
pronounceable combinations of vowels and consonants— an ability that hardly deceives to bo rated as “crjjyt- 
analytic" in any real sense. To say that it is possible to solve a cryptogram in a foreign language “without 
knowing a word of that language” is not quite the same as to say that it is possible to do so with only a slight 
knowledge of the language; and it may be stated without cavil that the better the cryptanalyst’s knowledge of 
the language, the greater arc the chances for his success and, in any case, the easier is his work. 
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17. The determination of the general system . — a. Except in the case 
of the more simple types of cryptograms, the step rof erred to as diagnosis , 
that is, ascertaining the general system according to which a given cryp- 
togram has been produced is usually a difficult, if not the most difficult, 
step in its solution. The reason for this is not hard to find. 

b. As will become apparent to* the student as he proceeds with his 
study, in the final analysis, the solution of every cryptogram, involvi 



a form of substitution depends upon its reduction to monoalphabctic terms. 



if it is not originally in those terms . This is true not only of ordinary 
substitution ciphers, but also of combined substitution -transposition 
ciphers, and of enciphered code. If the cryptogram must be reduced to 
monoalphabetic terms, the manner of its accomplishment is usually indi- 
cated by the cryptogram itself, by external or internal phenomena which 
become apparent to the cryptanalyst as he studies the cryptogram. If 
this is impossible, or too difficult, the cryptanalyst must, by one means 
or another, discover how to accomplish this reduction, by bringing to 
bear all the special or collateral information he can get from all the 
sources at his command. If both these possibilities fail him, there is 
little left but the long, tedious, and often fruitless process of elimi- 
nation. In the case of transposition ciphers of the more complex type, 
the discovery of the basic method is often simply a matter of long and 
tedious elimination of possibilities. For cryptanalysis has unfortunately 
not yet attained, and may indeed never attain, the precision found today 
in qualitative analysis in chemistry, for example, where the analytic 
process is absolutely clear-cut and exact in its dichotomy. A few words 
in explanation of what is meant may not be amiss. When a chemist seeks 
to determine the identity of an unknown substance, he applies certain 
specific reagents to the substance and in a specific sequence. The first 
reagent tells him definitely into which of two primary classes the unknown 
substance falls. He then applies a second test with another specific 
reagent, which tells him again quite definitely into which of two second- 
ary classes the unknown substance falls, and so on, until finally he has 
reduced the unknown substance to its simplest terms and has found out 
what it is. In striking contrast to this situation, cryptanalysis affords 
exceedingly few "reagents” or tests that may be applied to determine posi- 
tively that a given cipher belongs to one or the other ‘of two systems 
yielding externally similar results. And this is what makes the analysis 
of an isolated, complex cryptogram so difficult. Note the limiting adjec- 
tive "isolated” in the foregoing sentence, for it is vised advisedly. It 
is not often that the general system falls to disclose itself or cannot 
be discovered by painstaking investigation when there is a great volume 
of text accumulating from a regular traffic between numerous corre- 
spondents in a large organization. Sooner or later the system becomes 
known, either because of blunders and carelessness on the part of the 
personnel entrusted with the encrypting of the messages, or because the 
accumulation of text itself makes possible the determination of the 
general system by cryptanalytic, including statistical, studies . But in 
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the case of a single or even a few isolated cryptograms concerning which 
little or no information can he gained by the cryptanalyst, he is often 
unable, without a knowledge of, or a shrewd guess as to the general system 
employed, to decompose the heterogeneous text of the cryptogram into 
homogeneous, monoalphabet ic text, which is ‘rhe ■ultimate and essential 
step in analysis. The only knowledge that the cryptanalyst can bring to 
his aid in this most difficult step is that gained by long experience and 
practice in the analysis of many different types of systems . In this 
respect the practice of cryptanalysis is analogous to the practice of 
medicine: correct diagnosis is the most important and often the most 

difficult first step toward success. 

c. On account of the complexities surrounding this particular phase 
of cryptanalysis, and because in any scheme of analysis based upon suc- 
cessive eliminations of alternatives the cryptanalyst can only progress 

as far as the extent of his own knowledge of all the possible alternatives 
will permit, it is necessary that detailed discussion of the eliminative 
process be postponed until the student has covered most of the field. 

For example, the student will perhaps want to know at once how he can 
distinguish between a cryptogram that is in code or enciphered code from 
one that is in cipher. It is at this stage of his studies impracticable 
to give him any helpful indications on bis question. In return It may be 
asked of him why he should expect to be able to do this In the early 
stages of his studies when often the experienced expert cryptanalyst is 
baffled on the same score! 

d. Nevertheless, in lieu of more precise diagnostic tests not yet 
discovered, a general guide that may be useful in cryptanalysis will be 
built up, step by step as the student progresses, in the form of a series 
of charts comprising what may be designated An Analytical Key for Crypt- 
analysis . (See Section X.) It maybe of assistance to the studenb if, 
as he proceeds, he will carefully study the charts and note the place 
which the particular cipher he is solving occupies in the general crypt - 
analytic panorama. These charts admittedly constitute only very brief 
outlines, and can therefore be of but little direct assistance to him 

in the analysis of the more complex types of cryptosystems he may en- 
counter later on. So far as they go, however, they may be found to be 
quite useful in the study of elementary cryptanalysis. For the expe- 
rienced cryptanalyst they can serve only as a means of assuring that no 
possible step or process is inadvertently overlooked in attempts to solve 
a difficult cryptosystem. 

e. Much of the labor involved in cryptanalytic work, as referred 
to in par, 2, Is connected with this determination of the general system. 
The preparation of the text, its rewriting in different forms, sometimes 
being rewritten in dozens of ways, the recording of letters, the estab- 
lishment of frequencies of occurrences of letters, comparisons and 
experiments made with known material of similar character, and so on, 
constitute much labor that is most often indispensable, but which 
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sometimes turns out to have teen wholly unnecessary, or in vain. In one 
treatise^ it is stated quite boldly that "this work once done, the deter- 
mination of the system is often relatively easy." This statement can 
certainly apply only to the simpler types of cryptosystems} it is entirely 
misleading as regards the much more frequently encountered complex 
cryptograms of modern times. 

18. The reconstruction of the specific lcoy . — a.- Nearly all practi- 
cal cryptographic methods require the use of a specific key to guide, 
control, or modify the various steps under the general system. Once the 
latter has "been disclosed, discovered, or has otherwise come into the 
possession of the cryptanalyst, the next step in solution is to determine, 
if necessary and if possible, the specific key that was employed to en- 
crypt the message or messages under examination. This determination may 
not be in complete detail; it may go only so far as to lead to a know- 
ledge of the number of alphabets involved in a substitution cipher, or 
the number of columns involved in a transposition cipher, or that a one- 
part code has been used, in the case’ of a code system. But it is often 
desirable to determine the specific key in as complete a form and with 

as much detail as possible, for this information will very frequently be 
useful in the solution of subsequent cryptograms exchanged between the 
same correspondents, since the nature or source of the specific key in a 
solved ease may be expected to give clues to the specific key in an 
unsolved case . 

, • r - - 

b. Frequently, however, the reconstruction of the key is not a 
prerequisite to, and does not constitute an absolutely necessary preli- 
minary step in, the fourth basic operation, via ., the reconstruction or 
establishment of the plain text. In many cases, indeed, the two processes 
are carried along simultaneously, the one assisting the other, until in 
the final stages both 1 have been completed in their entireties. In still 
other cases the reconstruction of the specific key may follow the recon- 
struction of the plain -text instead of preceding it and is accomplished 
purely as a matter of academic interest; or the specific key may, in 
unusual cases, never 'be reconstructed. 

19. The reconstruction of the plain text . — a. Little need be said 
at this point on this phase of cryptanalysis . The process usually con- 
sists, in the case of substitution ciphers, in the establishment of 
equivalency between specific letters of the cipher text and the plain 
text, letter by letter, pair by pair, and so on, depending upon the par- 
ticular type of substitution system involved. In the case of transposi- 
tion ciphers, the process consists in rearranging the elements of the 
cipher text, letter by letter, pair by pair, or occasionally word by 
word, depending upon the particular type of transposition system involved, 
until the letters or words have been returned to their original plaintext 
order. In the case of code, the process consists in determining the 
meaning of each code group and inserting this meaning in. the code text to 
reestablish the original plain text. 



3 Lange et Soudart, op, 'cit ., p. 106. 
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b. The foregoing processes do not, as a rule, begin at the begin- 
ning of a message and continue letter by letter, or group by group in 
sequence. up to the very end of the message. The establishment of values 
of cipher letters in substitution methods, or of the positions to which 
cipher letters should be transferred to form the plain text in the case 
of transposition methods, comes at very irregular intervals in the pro- 
cess • At first only one or two values scattered here and there through- 
out the text may appear; these then form the "skeletons" of words, upon 
which further work, by a continuation of the reconstruction process, is 
made possible; in the end the complete or nearly complete^ - text is 
established. 

c. In the case of cryptograms in a foreign language, the transla- 

tion of the solved messages is a final and necessary step, but is not to 
be considered as a cryptanalytic process . However, it is commonly the 
case that the translation process will be carried on simultaneously with 
the cryptanalytic, and will aid the latter, especially when there are 
lacunae which my be filled in from the context. (See also subpar. l6c 
in this connection.) ” 

20. The utilization of traffic Intercepts . 5 — a. There are, of 
course, other operations which are not as basic in'" nature as those Just 
outlined but which must generally be performed as preliminary steps in 
practical cryptanalytic work (as distinguished from academic cryptana- 
lysis)". Before a military cryptanalyst can begin the analysis of an 
enemy cryptosystem, it is necessary for him to study the intercept mate- 
rial that is available to him, isolate the messages that h.ve been 
encrypted by means of the cryptosystem to be exploited, and to arrange 
the latter in a systematic order for analysis. This work, although 
apparently very simple, my require a great deal of time and effort. 

b. Since, whenever practicable, two or more intercept stations 
are assigned to copy traffic emanating from the stations of one enemy 
radio net, it is natural that there should be a certain amount of 
duplication in the work of the several stations. This is desirable 
since it provides the cryptanalysts with two or more sets of the same 
messages, so that when one intercept station fails to receive all the 
messages completely and correctly, because of radio difficulties, local 
static, or poor operation, it is possible by studying the other sets to 
reconstruct accurately the entire traffic of the enemy net. 

Sometimes in the case of code, the meaning of a small percentage 
of the code groups occurring in the traffic my be lacking, because 
there is insufficient text to establish their meaning. 

5 A traffic intercept is a copy of a 'communication gained through 
interception. 
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c , In all intercept activities where operators are used for copying 
the traffic, one of the most likely errors to he found is caused hy the 
human element in reception. For this reason cryptanalysts and their 
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A 


<• mm 


i, m, t, et 


S 


• • • 


h, d, i, r, u 


B 


mm « • 0 


d, ts 


T 


- 


a, e, n 


C 


mm * m 0 


f, k, r, nn 


U 




a, s, v, it 


D 


mm m m 


b, s, 1, ti 


mm 


0 0 0mm 


h, u, x, st 


E 


* 


t, i 


mm 


... 


a, m, o, r, u. 


F 


• •rnm 0 


r, in 


X 


mm 0 0 mm 


v, k, y, tu 


G 


mm mm • 


m, o, z, me 


Y 




x, c, nm 


H 


0 0 0 0 


s, v, b, ii, se 


Z 


MM • 0 


b, g, q, mi 


I 


0 • 


a, n, s 


1 


0 mm mm mm mm 


0, a 


J 


* at mmm 


w, o, an, eo 


2 


0 0 MMM 


1, 3 


K 


mm 0 mm 


d, o, ta 


3 


0 0 9 mm m 


2, k 


L 


0 m 0 0 


r, d, ed 


k 


0 0 0 0mm 


3, l 


M 


mm mm 


a, n, tt 


5 




h, 6 


N 


rnm 0 


i, m, t, te 


6 




•5, 7 


0 


mm mm mm 


g, k, w, mt 


T 


mm • • « 


6, 8 


P 


0 mm mm 0 


J, g, 1, w, an 


8 


MMM ft * 


7, § 


Q 


mm mm 0 mm 


o, x, z, ma 


2 


MMMM • 


8, 0 


R 


0 mm m 


a, f, g, 1, n, s, w 


<f> 




9, 1 



Chart 1. Most common errors in telegraphic transmission. 



assistants should be familiar with the international Morse alphabet and 
the most common errors in wire and radio transmission methods so as to he 
able to correct garbled groups when they occur. In this connection, 

Chart 1, above, will be found useful. 
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SECTION I V 

FREQUENCY DISTRIBUTIONS AND THEIR FUNDAMENTAL USES 



Paragraph 



The simple or uniliteral frequency distribution ...... 21 

Important features of the normal uniliteral frequency 

distribution. • 22 

Constancy of the standard or normal uniliteral frequency 

distribution 23 

The three facts which can be determined from a study of the 

uniliteral frequency distribution for a cryptogram 2k 

Determining the class to which a cipher belongs..*.................* 25 

Determining whether a substitution cipher is monoalphabetic or 

non-monoalphabet ic 26 

The 4> (phi) test for determining monoalphabetieity 27 

Determining whether a cipher alphabet is standard (direct 

or reversed) or mixed 28 



21. The simple or uniliteral frequency distribution . — a. It has 
long been known to cryptographers and typographers that the letters com- 
posing the words of any intelligible written text composed in any language 
which is alphabetic in construction are employed with greatly varying 
frequencies. For example, if on cross-section paper a simple tabulation, 
shown in Fig. 1, called a uniliteral frequency distribution, is made of 
the letters composing the words of the preceding sentence, the variation 
in frequency is strikingly demonstrated. It is seen that whereas certain 
letters, such as A, E, I, N, 0, R, and T, are employed very frequently, 
other letters, such as C, G, H, L, P, and S are employed not nearly so 
frequently, while still other letters, such as F, J, K, Q, V, X, and Z 
are employed either seldom or not at all. 



g 5 

A B 
u a 



C D E F 

8' 4 22 2 



G H I J 

9 10 16 0 



lls g g g 

5iggg~.gggS:-g- 

MNOPQRSTUVWX 

3 17 14 8 1 IS 10 20 3 1 S 1 



(Total =200 letters) 



Y Z 

7 0 



Figure 1. 



b. If a similar tabulation is now made of the letters comprising the 
words"~of the second sentence in the preceding subparagraph, the distribu- 
tion shown in Fig. 2 i3 obtained. Both sentences have exactly the same 
number of letters (200) . 
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ABCDEFGHIJKLMNOPQRSTUVWXYZ 



12 2 8 7357 4 C 30 0 1 9 6 17 14 6 213 14 17 6 1 2 1 3 0 

(Total =200 letter^) 

Figure 2. 

c. Although each of these two distributions exhibits great varia- 
tion in the relative frequencies with which different letters are employed 
in the respective sentences to’ which they apply, no marked differences 
are exhibited between the frequencies of the same letter in the two dis- 
tributions. Compare, for example, the frequencies of A, B, C . . . Z in 
Fig. 1 with those of A, B, C . . . Z in Fig. 2. Aside from one or two 
exceptions, as in the case of the letter F, these two distributions agree 

rather strikingly. 

<• 

d. This agreement, or similarity , would be practically complete if 
the two texts were much longer, for example, five times as long. In fact, 
when two texts of similar character, each containing more than 1,000 let- 
ters, are compared, it would be found that the respective frequencies of 
the 2 6 letters composing the two distributions show only very slight dif- 
ferences. This means, in other words, that in normal plain text each 
letter of the alphabet occurs with a rather constant or characteristic 
frequency which it tends to approximate, depending upon the length of the 
text analyzed. The longer the text (within certain limits), the closer 
will be the approximation to the characteristic frequencies of letters 

In the language Involved. However, when the amount of text being ana- 
lyzed has reached a substantial volume (roughly, 1,000 letters), the prac- 
tical gain in accuracy does not warrant further increase in the amount 
of texfc.-*- 

e. An experiment along these lines will be convincing. A series 
of 260 official telegrams 2 passing through the Department of the Army 
Message Center was examined statistically.' The messages were divided 
into five sets, each totaling 10,000 letters, and the five distributions 
shown in Table 1-A, were obtained. 

1 See footnote 5, page 38 . 

2 These comprised messages from several official sources in addition 
to the Department of the Army and were all of an administrative character. 
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Tablg 1-A. — Absolute frequencies of letters appearing in five sets of Governmental plain-text telegrams, 
each set containing 10,000 letters, arranged alphabetically 



Set No. 1 


Sot No. 2 


Set No. 8 


Set No. 4 


Sat No. 5 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Latter 


Absoluts 

Frequency 


A 


738 


A 




A. 


681 


A . 


740 


A. 


741 


B 


104 


R 




R 


98 


R 


83 


B 


99 


0 


819 


O 


300 


C 


288 


C 


826 


C 


301 


n 


887 


D 


418 


D 


423 


D 


461 


D 


448 


E 


1,867 


E 


1,294 


E. 


1,292 


E 


1,270 


E 


1,276 


F 


263 


F... 


287 


F. 


308 


F 


287 


F 


281 


fi 


166 


G 


176 


G 


161 


G .... 


167 


G 


160 


H 


310 


H 


361 


H 


835 


H. 


849 


H 


849 


I . 


742 


I 


760 


T 


787 


I 


700 


I 


697 


.T 


18 


J 


17 


.T 


10 


.1 


21 


J 


16 


K 


86 


K. 


38 


K 


22 


K_ 


21 


K 


81 


L.. . 


366 


L 


893 


L 


388 


L 


886 


L. 


844 


M 


242 


M . 


240 


M 


238 


M ... 


249 


M. 


268 


N 


786 


N 


794 


N . . 


815 


N 


800 


N 


780 


n 


686 


0. 


770 


O 


791 


0. 


766 


O 


762 


p 


241 


P. . 


272 


P 


817 


P 


245 


P 


260 


q 


40 


q 


22 


P 


46 


Q. ... 


38 


Q ....... 


80 


R 


760 


R 


746 


R 


762 


R 


786 


R . 


786 


s 


668 


s 


683 


S. 


586 


R 


628 


R 


604 


T 


936 


T 


879 


T... 


894 


T 


968 


T. ~ 


928 


m 


270 


II 


233 


U. 


312 


u! 


247 


II 


288 


wmmm 


163 


V 


173 


V 


142 


V. . 


133 


V. 


166 




166 


W. 


163 


w. _ . 


186 


w„ 


183 


W 1- 


182 




43 


X 


60 


X 


44 


X.. _ 


63 


X. . 


41 


H wmdS^Slmii 


191 


Y ...... 


166 


Y 


179 


Y_ 


213 


Y ........ 


229 


7 . 


14 


7. 


17 


7 . 


2 


7 , 


11 


7 . 


6 






















Total 


10,000 




10,000 




10,000 




10,000 

















f . If the five distributions in Table 1-A are summed, the results 
are as shown in Table 2-A. 



Table 2-A. 



-Absolute frequencies of tetters appearing in the combined five sets of messages totaling 
60,000 letters, arranged alphabetically 



A.... 


. 3,683 


G. 


819 


L 


1 ,82 f 


0 — 


175 


V. 


766 


B 


487 


H 


.. 1,694 


M_ 


1 ,237 


R 


3,788 


W. 


1780 


C.... 


. 1,534 


I 


.. 3,676 


N. 


3 ,975 


S. 


3,058 


X. 


231 


D„_. 


. 2, 122 


J 


82 


0 


3 ,764 


T 


4, 595 


Y. 


967 


E____ 
F 


. 6,498 
- 1,416 


K 


J48 


P. 


1 ,335 


U. 


1,300 


Z. 


49 
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g. The frequencies noted in Table 2-A above, when reduced to the 
basis of 1,000 letters and then v 'sed as a basis for constructing a simple 
chart that will exhibit the variations in frequency in a striking manner, 
yield the following distribution which is hereafter designated as the 
normal or standard uniliteral frequency distribution for English 
telegraphic plain text: 

g 



g 

g 




BCDEFGHIJKL 





QRSTUVWXYZ 



74 10 31 43 130 28 18 34 74 3 3 38 25 79 75 37 8 78 61 92 20 15 16 K 19 1 



Figure 3, 



22. Important features of the normal uniliteral frequency distri - 
bution . — a. When the distribution shown in Fig. 3 is studied in detail, 
the following features are apparent: 

(1) It is quite irregular in appearance. This is because the letters 
are used with greatly varying frequencies, as discussed in the preceding 
paragraph. This irregular appearance Is often described by saying that 
the distribution shows marked crests and troughs , that is, points of high 
frequency and low frequency. 

(2) The relative positions in which the crests and troughs fall 
within the distribution, that is, the spatial relations of the crests and 
troughs, are rather definitely fixed and are determined by circumstances 
which have been explained in subpar. 13b. 

(3) The relative heights and depths of the crests and troughs within 
the distribution, that is, the linear extens i ons of the lines marking the 
respective frequencies, are also rather, definitely fixed, as would ho 
found if an equal volume of similar text were analysed. 
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00 The most prominent crests are marked by the vowels A, E, X, 0, 
and the consonants IT, R, S, T; the most prominent troughs are marked by 
the consonants J, K, Q, X, and Z, 

(5) The important data are summarized in tabular form in Table 3 . 



Table 3 



* 


Frequency 


Percent of 
total 


Percent of 
total in 
round 
numbers 


6 Vowels: A E I 0 U Y. 


398 


39.8 


40 


20 Consonants: 

5 High Frequency (D N R S T) 


350 


35.0 


35 


10 Medium Frequency (BCFGHLMPV W) 


238 


23.8 


24 


5 Low Frequency ( J K Q X Z) 


14 


1.4 


1 


Total ... - - 


U 


■ 


100 





( 6 ) The frequencies of the letters of the alphabet, reduced to a 



base of 1000, are 
A 74 


as 

G... 


follows : 
ifi 


T. 


36 


Q_ 


3 


R 


10 


H... 


34 


M. 


25 


R 


76 


C 


.... 31 


I... 


74 


" N. 


79 


S 


61 


D... . 


.... 42 


J... 


2 


n 


75 


T 


92 


E 


.... 130 


K... 


3 


p 


27 


II 


28 


F 


.... 28 











V. 

W. 

X 

Y. 

Z 



15 

16 
5 

19 

1 



( 7 ) The relative order of frequency of the letters is as follows: 



E. 


130 


I 


... 74 


C 


31 


Y 


19 


X... 


T 


92 


S 


... 61 


F 


28 


G. 


16 


p 


N ^ 


79 


D ' _ 


... 42 


P 


27 


W 


16 


K. 


R 


76 


L 


... 36 


li 


26 


V. 


15 


.T 


0 


75 


H 


... 34 


M 


25 


B 


10 


7 


A 


74 













6 

3 

3 

2 

1 



- (8) The four vowels A, E, I, 0 (combined frequency 353) and the four 

consonants N, R, S, T (combined frequency 308) form 66l out of every 
1,000 letters of plain text; in other words, less than one-third of the 
alphabet is employed in writing two -thirds of normal plain text . 
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b. The data given in Fig. 3 and Table 3 represent the relative fre- 
quencies found in a large volume of English telegraphic text of a govern- 
mental, administrative character .3 These frequencies will vary somewhat 
with the nature of the text analyzed. For example, if an equal number 

of telegrams dealing solely with commercial transactions in the leather 
industry were studied statistically, the frequencies would "be slightly 
different because of the repeated occurrence of words peculiar to that 
industry. Again, if an equal number of telegrams dealing solely with 
military messages of a tactical character were studied statistically, 
the frequencies would differ slightly from those found above for general 
governmental messages of .an administrative character. 

c. If ordinary English literary text (such as may be found in any 

book, "newspaper, or printed document) were analyzed, the frequencies of 
certain letters would be changed to an appreciable degree. This is 
because in telegraphic text words which are not strictly essential for 
intelligibility (such as the definite and indefinite articles, certain 
prepositions, conjunctions, and pronouns) are omitted. In addition, 
certain essential words, such as "stop”, "period", "comma", and the like, 
which are usually indicated in written or printed matter by symbols not 
easy to transmit telegraphically and which must, therefore, he spelled 
out in telegrams, occur very frequently. Furthermore, telegraphic text 
often employs longer and more uncommon words than does ordinary newspaper 
or book texb, ' • • £ - 

d. As a matter of fact, other tables compiled from Army sources 
gave slightly different results, depending upon the source of the text. 
For example, three tables based upon 75,000, 100,000, and 136,257 letters 
token from various sources (telegrams, newspapers, magazine articles, 
books of fiction) gave as the relative order of frequency for the first 
10 letters the following: 

For 75,000 letters ETRNIOASDL 

For 100,000 letters ETSIHOASDL 

For 136,257 letters ETRNAOISLD 



3 Just as the individual letters constituting a large volume of plain 
text have more or less characteristic or fixed frequencies, so it is 
found that di graphs and trlgrapho (two- and three -letter combinations, 
respectively)' have characteristic frequencies, when a large volume of 
text is studied statistically. In Table 6 of Appendix 2, "Letter fre- 
quency data - English", are sheem the relative frequencies of all digraphs 
appearing in the 260 telegrams referred to in subpar. 21e. This appendix 
also Includes several other kinds of tables and lists of - frequency data 
which will be useful to the student in his work. It is suggested that 
the student refer to this appendix now, to gain an idea of the data 
available for his future reference. 

Other languages, of course, each have their own individual charac- 
teristic plaintext frequencies of single letters, digraphs, trigraphs, 
etc. A brief sunusary of the letter frequency data for German, French, 

I! ■J.ion, Spanish, Portuguese, and Russian constitute Appendix 5, "Letter 
fr iquoncy data - foreign languages". 
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e. Frequency data applicable purely to English military text were 
compiled by Hitt,^ from a study of 10,000 letters taken from orders and 
reports. The frequencies found by him are given in Tables 4 and 5* 

- - Table 4. — Frequency table for 10,000 letters of literary English , as compiled by Hitt 



ALPHABETICALLY ARRANGED 



A 


. - 778 


G. 


174 


L 


872 


9 ' 


8 ' 


V. 


i 12 


B. . 


141 


H 


595 


M... 


2RR 


R. 


651 


W 


176 


C.u. 


. 296 


L 


667 


N. 


686 


S. 


622 


X 


27 


D.„. 


. 402 


J 


51 


O 


807 


T. 


855 


Y. 


196 


E 


. 1 ,277 


K. 


74 


P 


223 


II 


308 


7. 


17 


F 


. T 197 















1 




ARRANGED ACCORDING TO FREQUENCY 



E — 


l ,277 


R 


651 


U. 


308 


¥ ’ 


.. 196 


K. 


74 


T — 


855 


S. 


622 




296 


W 


.. 176 


.1 


... 51 


0 


807 


H. 


595 


M. 


288 


a 


.. 174 


X 


.... 27 


A 


778 


D 


402 


P 


223 


B 


„ 141 


7 


17 


N 


686 


L 


372 


P 


197 


V 


.. 112 


O 


8 


I 


667 















Table ^—Frequency table for 10,000 letters of telegraphic English, as compiled by Hitt 



ALPHABETICALLY' ARRANGED 



A... 


.... 813 


G. 


201 


L. 


... 392 


Q..... 


38 


V... 


136 


a.. 


149 


h. :. 


386 


M 


... 273 


R..... 


CL*rry 

‘m—m Wl 1 


W... 


166 


c_ 


... 306 


I 


711 


N. 


... 7J8 


S..„. 


656 


X... 


61 


D... 


417 


J ... 


42 


0 


... 844 


T 


634 


Y... 


208 


E... 


.... 1,319 


K. 


88 


P 


... 243 


U..... 


321 


Z... 


6 



F 205 



ARRANGED ACCORDING TO FREQUENCY 



E... .. 


1 ,319 


S. 


... 656 


U. 


321 


F. 


... 205 


KL 


... 88 


0 


844 


T. 


... 634 


C 


306 


G. 


... 201 


X. 


„ 51 


A 


813 


D. 


... 417 


NL 


273 


W...... 


... 166 


J, 


... 42 


N 


718 


L - 


... 392 


P 


243 


B 


... 149 


q 


. 38 


I 


711 


H. 


... 386 


' Y 


208 


V...... 


... 136 


Z. 


... 6 


R. 


677 






















- 






- 


• 




- 


- 



23. Constancy of the standard or normal uniliteral frequency dis - 
tribution , --a. The relative frequencies disclosed by the statistical 
study of large volumes of text may be considered to be the standard or 
normal frequencies of the letters of written English. Counts made of 
smaller volumes of text will xend to approximate these normal frequencies, 



Gp. clt ., pp. 6-7. 
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and, within certain limits, 5 the smaller the volume, the lower will' he 
the degree of approximation to the normal, until, in the case of a very- 
short message, the normal proportions may not manifest themselves at all. 
It is advisable that the student fix this fact firmly in mind, for ‘the 
sooner he realizes the true nature of any data relative to the frequency 
of occurrence of letters in text, the less often will his labors toward 
the solution of specific ciphers he thwarted and retarded by too strict 
an adherence to these generalized principles of frequency. He should 
constantly bear in mind that such data are merely statistical generaliza- 
tions, that they will be found to hold strictly true only in large Volumes 
of text, and that they may not even be approximated in short messages. 

b. Nevertheless the normal frequency distribution or the “normal 
expectation" for any alphabetic language is, in the last analysis, the 
best guide to, and the usual basis for, the solution of cryptograms of a 
certain type. It is useful, therefore, to reduce the normal, uniliteral 
frequency distribution to a basis that more or less closely approximates 
the volume of text which the cryptanalyst most often encounters in indi- 
vidual cryptograms. As regards length of messages, counting only the 
letters in the body, and excluding address and signature, a study of 
the 260 telegrams referred to in par. 21 shows that the arithmetical 
average is 217 letters; the statistical mean, or weighted average®, 
however, is 191 letters. These two results are, however, close enough 
together to warrant the statement that the average length of telegrams is 
approximately 200 letters. The frequencies given in par. 21 have there- 
fore been reduced to a basis of 200 letters, and the following unilite- 
ral frequency distribution may be taken as showing the most typical 
distribution to be expected in 200 letters of English telegraphic text:. 



ABCDEFGHIJ 



§ 2 = =: §= 

= II g g g 

KLMNOPQRSTUVWXYZ 
Figure 4. 



^It is useless to go beyond a certain limit in establishing the normal-frequency distribution for a given 

language. As a striking instance of this faot, witness the frequency study made by an indefatigable German, 
Kaeding, who in 1898 made a count of the letters in about 11,000,000 words, totaling about 62,000,000 letters in 
German text. When reduced to a percentage basis, and when the relative order of frequency was determined, 
the results he obtained differed very little from the results obtained by Kasiski, a German cryptographer, from a 
count of only 1,000 letters. See Kaeding, Haeufigheilswoerterbueh, Stegiitz, 1898; Kasiski, Die Geheimschriften 
und die Dcchijjft ir-Kunst, Berlin, 1863. 

l 

° The arithmetical average is obtained by adding each different length 
and dividing by the number of different -length messages; the mean is ob- 
tained by multiplying each different length by the number of messages of 
that length, adding all products, and dividing by the total number of 
messages . 
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c. The student should take careful note of the appearance of the 
distribution? shown in Fig, 4, for it will he of much assistance to him 
in the early stages of his study. The manner of Betting down the tallies 
should he followed hy him in* making his ora distributions, indicating 
every fifth occurrence of a letter hy an oblique tally. This procedure 
almost automatically shows the total number of occurrences for each let- 
ter, and yet does not destroy the graphical appearance of the distribu- 
tion, especially if care is taken to use approximately the same amount 

of space for each set of five tallies. Cross-section paper is very 
useful for this purpose. 

d. The word "uniliteral" in the designation "uniliteral frequency 
dis fcribution" means “single” letter", and it is to be inferred that other 
types of frequency distributions may be encountered. For example, a 
distribution of pairs of letters, constituting a biliteral frequency 
distribution, is very often used in the study of certain cryptograms in 
which it is desired that pairs made by combining successive letters be 
listed. A biliteral distribution of A B C D E F would take these pairs: 
AB, BC, CD, DE, EF. The distribution could be made in the form of a 
large square divided up into 676 cells. When distributions beyond bi- 
literal are required (tri Literal, quadriliteral, etc.) they can only be 
made by listing them in some order, for example, alphabetically based on 
the 1st, 2d, 3d, . . * letter. 



? The use of the terms "distribution" and "frequency distribution", 
instead of "table" and "frequency table", respectively, is considered 
advisable from the point of view of consistency with the usual statistical 
nomenclature . When data are given in tabular form, with frequencies 
indicated by numbers, then they may properly be said to be set out in 
the form of a table . When, however, the same data are distributed in a 
chart which partakes of the nature of a graph, with the data indicated 
by horizontal or vertical linear extensions, or by a curve connecting 
points corresponding to quantities, then it is more proper to call such 
a graphic representation of the data a distribution. 
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2k. The three facts which can be determined from a study of the 
tmllitoral frequency distribution for a cryptogram *" -a. The following 
threo facts' (to he explained subsequently)' can usually he determined from 
an inspection of the uniliteral frequency distribution for a given cipher 
message of average length, composed of letters: 

(1) Whether the cipher belongs to the substitution or the transpo- 
sition class: s 

(2) If to the former, whether it is monoalphabetie 8 or non- 
monoalphabetic^ in character j 

(3) If monoalphabetie , whether the cipher alphabet is standard 

(direct or reversed) or mixed, * 

b. For immediate purposes the first two of the foregoing determi- 
nations are quite important and will be discussed in detail in the next 
tiro paragraphs j the other determination will be touched upon very briefly, 
leaving its detailed discussion for subsequent sections of the text, 

25, Determining the class to which a cipher belong s,— a. The deter- 
mination of the class to which a cipher belongs is usually a relatively 
easy matter because of the fundamental difference between transposition, 
and substitution as cryptographic processes. In a transposition cipher 
the original letters of the plain text have merely been rearranged, 
without any change whatsoever in their identities, that is, in the 
conventional values they have in the normal alphabet. Hence, the nuidbers 
of vowels (A, E, I, 0, U, Y) , high-frequency consonants (D, H, R, S, T), 
medium-frequency consonants (B, C, F, G-, H, L, M, P, V, W), and low- 
frequency consonants (J, K, Q, X, Z) are exactly the same in the 
cryptogram as they are in the plaintext message. Therefore, the 
percentages of vowels, high-, medium-, and low-frequency consonants are 
the some in the transposed text as in the equivalent plain text. In a 

8 In" connection with uniliteral frequency distributions, the term 
monoalphabetie is considered to embrace the concept of monoalphabetie - 
monographic -uniliteral systems only, thus excluding polygraphic and 
rjultilitox'al systems, both of which, however, usually fall into the 
monoalphabetie category, 

9 The term non-monoalphabet ic as applied in this instance is consi- 
dered to embrace all deviations from the characteristic appearance of 

1 monoalphabetie distributions , These deviations include the phenomena 
inherent in polyalphabetic, polygraphic, and multiliteral cryptograms, 
as well as in random text, i.e,, text which appears to have been pro- 
duced by chance or accident, having no discernible patterns or 
limitations , 
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substitution cLphcx*, on the other hand, the identities of the original 
Icbtero ox the plain tc:cfc liavo been changed, that is, the conventional 
values they have in the normal alphabet have been* altered. Consequently, 
if a count is mode of the various letters present in such a cryptogram, 
it -will he found that the number of vowels, high-, medium-, and low- 
frequcncy consonants will usually he quite different in the cryptogram 
Crow 'what they are in the original plaintext message. Therefore, the 
Percentages of vowels , high-, medium-, and loir-frequency consonants are 
usually quite different in the substitution text from what they are in 
the equivalent plain text. From these considerations it -follows that if 
in a specific cryptogram the percentages of vowels, high-, medium-, and 
low-frequency consonants are approximately the some as would he expected 
in normal plain text, the cryptogram prohahly belongs to the transposition 
class; if these percentages are quite different from those to he expected 
in normal plain text the cryptogram prohahly belongs to the substitution 
class . 

h. In the preceding subparagraph the 'word "probably” was emphasized 
by italicizing it, for there can be no certainty in every case of this 
determination. Usually these percentages in a transposition cipher are 
close to the normal percentages for plain text; usually , in a substitu- 
tion cipher, they are far different from the normal percentages for plain 
text. But occasionally a cipher message is encountered which is difficult 
to classify with a reasonable degree of certainty because the message is 
too' short for the general principles of frequency to manifest themselves . 
It is clear that if in actual messages there were no variation whatever 
f’-on the normal vowel and consonant percentages given in Tab.l e 3, the 
determination of the class to which a specific cryptogram belongs would 
be an extremely simple matter. But unfortunately there is always some 
variation or deviation from the normal. Intuition suggests that as 
messages decrease in length there may be a greater and greater departure 
from the normal proportions of vowels , high-, medium-, and loir-frequency 
consonants, until in’ very short messages the normal proportions may not 
hold nt all." Similarly, as messages increase in length there may be a 
lesser and lesser departure from the normal proportions, until in messages 
totalling a thousand or more letters there may be no difference at all 
between the actual and the theoretical proportions. But intuition is not 
enough, for in dealing vith specific messages of the length of those 
commonly encountered in practical work the question sometimes arises as 
to exactly how much deviation (from the normal proportions) may be allowed 
for in a cryptogram which shears a considerable amount of deviation from 
the normal and which might still belong to the transposition rather than 
to the substitution class. 

c. Statistical studies have been made on this matter and some graphs 
have been constructed thereon. These are shown in Charts 2 - 5 in. the 
form of simple curves, the use of which will now be explained. Each 
chart contains tiro curves marking the lower and upper limits, respect - 
ivolv. of the theoretical amount of deviation (from the normal percent- 
ages) of vowels or consonants irhich may be allowable in a cipher believed 
Lo belong to the transposition class. 
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d, In Chart 2, curve marks the lover limit of the theoretical 
amount of deviation 3 - 0 from the number of vowels theoretically expected to 
appear 3 - 3 - in a message of given length; curve Vg marks the upper limit 

of the same statistic. Thus, for example, in a message of 100 letters 
in plain English there should he between 33 and 47 vowels (A E I 0 U l), 
Likewise, in Chart 3 curves Hp and H2 mark the lower and upper limits as 
regards the high-frequency consonants . In a message of 100 letters -there 
should he between 28 and 42 high-frequency consonants (DURST). -In 
Chart 4 curves Mp and Mg mark the lower and upper limits as regards the 
medium-frequency consonants. In a message of 100 letters there should be 
between 17 and 31 medium-frequency consonants (BCFGKLMPVW). 
Finally, in Chart 5, curves Lp and Lg mark the lower and upper limits as 
regards the loir-frequency consonants . In a message of 100 letters there 
should be between 0 and 3 loir-frequency consonants ( J IC Q X Z) . In using 
the charts, therefore, one finds the point of intersection of the vertical 
coordinate corresponding to the length of the message, with the horizontal 
coordinate corresponding to (l) the number of vowels, (2) the number 
of high-frequency consonants, (3) the number of medium- frequency con- 
sonants, and'(4) the number of loir-frequency consonants actually counted 
in the message. If all four points of intersection fall within the 
area delimited by the respective curves, then the numbers of vowels and 
high-, medium-, and low-frequency consonants correspond with the numbers 
theoretically expected in a normal plaintext message of the same length; 
since the message under investigation is not plain text, it follows that 
the cryptogram may certainly be classified as a transposition cipher. On 
the other hand, if one or more of these points of intersection fall out- 
side the area delimited by the respective curves, it follows that the 
cryptogram is probably a substitution cipher. The distance that the point 
of intersection falls outside the area delimited by these curves is a more 
or less rough measure of the improbability of the cryptogram's being a 
transposition cipher. 

e. Sometimes a cryptogram is encountered which is hard to classify 
with "certainty even with the foregoing aids, because it has been con- 
sciously prepared with a view to making the classification difficult. 

This can be done either by selecting peculiar words (as in “trick crypto- 
grams") or by employing a cipher alphabet in which letters of approx - 
imately similar normal frequencies have been interchanged. For example, 

E may be replaced by 0, T by R, and so on, thus yielding a cryptogram 
giving external indications of being a transposition cipher but which is 
really a substitution cipher. If the cryptogram is not too short, a close 
study will usually disclose what has been done, as well as the futility of 
so simple a subterfuge. 



I® In Charts 2-5, inclusive, the limits of the upper and lower curves 
have been calculated to include approximately 70 percent of messages of 
the various lengths^. _ 

^ The expression "the number of ... theoretically expected to appear" 
is often condensed to "the theoretical expectation of . . ." or "the normal 
e>qpectation of ..." 
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f . In. the major! by of cases, in practical work, the determination of 
the class to which a cipher of average length belongs can he made from a 
mere inspection of the message, after the cryptanalyst has acquired a 
familiarity with bhe normal appearance of transposition and of 3 ubstitu- 
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Number of letters m mossago. 

Chart 2. Curves marking the lover and upper limits of 
the theoretical amount of deviation from the number of vowels 
theoretically expected in messages of various lengths . 

(See subpar. 25d.) 



tion ciphers. In the former case, his eyes very speedily note many high- 
frequency letters, such as E, T, N, R, 0, and S, with the absence of 
low-frequency letters, such as J, K, Q, X, and Z; in the latter cose, his 
<"y-; S just as quickly note the presence of many low- frequency letters, and 
n corresponding absence of some of the high-frequency letters . 
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g. Another rather quickly completed test, in the case of the' simpler 
varieties of ciphers , is to look for repetitions of groups of letters * As 
will become apparent very soon, recurrences of syllable's, entire vprds and 
short phrases constitute a characteristic of all normal plain text;. Since 
a transposition cipher involves a change ir the sequence of the letters 
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Chart 3. Curves marking the lover and upper limits of 
the theoretical amount of deviation from the number of high- 
frequency consonants theoretically expected in messages of 
various lengths. (See subpar. 25d.) 



composing a plaintext message, such recurrences are broken up so that 
the cipher text no longer will show repetitions of more or less lengthy 
sequences of letters. But if a cipher message does shew many repetitions 
and these are of several letters in length, say over four or five, the 
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Chart h. Curves marking the lower and upper limits of 
the theoretical amount of deviation from the number of medium- 
frequency consonants theoretically expected in messages of 
various lengths. (See subpar. 25d.) 



medium-, and low-frequency consonants. Then, referring to Charts 2 to 5 , 
he should carefully note whether or not the observed frequencies for these 
categories of letters fall within the limits of the theoretical frequen- 
cies for a normal plaintext message of the same length, and be guided 
accordingly. ' 
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h. It is obvious that the foregoing rule applies only to ciphers 
composed wholly of letters- If a message is composed entirely of figures, 
or of arbitrary signs and symbols, or of intermixtures of letter^ J figures 
end other symbols, it Is immediately apparent that the cryptogram, is a 
substitution cipher. j! 
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Chart 5. Curves marking the lower and upper limits of 
the theoretical amount of deviation from the number of low- 
frequency consonants theoretically expected in messages of 
various lengths. (See subpar. 25d.) 



i. Finally, it should be mentioned that there are certain kinds of 
cryptograms whose class cannot be determined by the method set forth in 
subparagraph d above . These exceptions will be discussed in a subsequent 
section of this text .!2 



12 Section X. 
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26, Determining whether a substitution cipher is monoalphabetic or 
non-monoalphabetlc . — a. It will be remembered that a monoalphabetic 
substitution cipher is one in which a single cipher alphabet is employed 
throughout the ..iiole message; that is, a given plaintext letter is In- 
variably represented throughout the message by one and the same letter in 
the cipher text. On the other hand, a polyalphabetic substitution cipher 
is one in which two or more cipher alphabets are employed within the same 
message; that is, a given plaintext letter may be represented by two or 
more different letters In the cipher text, according to some rule govern- 
ing the selection of the equivalent to be used in each case. From this 
It follows that a single cipher letter may represent two or more different 
plaintext letters. A similar situation prevails In the case of multi- 
literal substitution, in which a particular cipher letter may constitute 
a part of the equivalents for several plaintext letters, giving rise to 
phenomena resembling those of polyalphabeticity. 

b. It is easy to see why and how the appearance of the uniliteral 
frequency distribution for a substitution cipher may be used to determine 
whether the cryptogram is monoalphabetic or non-monoalphabetic in char- 
acter. The normal distribution presents marked crests and troughs by 
virtue of two circumstances . First, the elementary sounds which the 
symbols represent are used with greatly varying frequencies. It being one 
of the striking characteristics of every alphabetic language that its 
elementary sounds are used with greatly varying frequencies.^ in the 
second place, except for orthographic aberrations peculiar to certain 
languages (conspicuously, English and French), each such sound is rep- 
resented by the same symbol. It follows, therefore, that since In a 
monoalphabetic substitution cipher each different plaintext letter 

( selementary sound) is represented by one and only one cipher letter 
(^elementary symbol), the uniliteral frequency distribution for such a 
cipher message must also exhibit the irregular crest-and-trough appearance 
of the normal distribution, but with this important modification— the 
absolute positions of the crests and troughs will not be the same as In 
the normal . That is, the letters accompanying the crests and the troughs 
in the distribution for the cryptogram will be different from those accom- 
panying the crests and the troughs in the normal distribution. But the 
marked irregularity or "roughness" of the distribution, that is, the 
presence of accentuated crests and troughs, Is in itself an indication 
that each symbol or cipher letter always represents the same plaintext 
letter in that cryptogram. Hence the general rule: A marked crest-and - 

trough appearance in the uniliteral frequency distribution for a given 
cryptogram indicates that a single cipher alphabet is involved and 
constitutes one of the tests for a monoalphabetic substitution cipher . 

c. On the other hand, suppose that In a cryptogram each cipher 
letter represents several different plaintext letters. Some of them are 
of high frequency, others of low frequency. The net result of such a 



-*-3 The student who is interested in this phase of the subject may find 
the following reference of value: Zipf G.K., Selected Studios of the 

Principle of Relative Frequency in Language , Cambridge, Mass., 1932. 
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situation, so far as the unlliteral frequency distribution for the 
cryptogram is concerned, is to prevent the appearance of any marked 
crests and troughs and to tend to reduce the elements of the distribution 
to a more or less common level. This imparts a "flattened out" appear- 
ance to the distribution. For example, in a certain cryptogram of 
polyalphabetic construction, K c =Ep, Gp, and J p ; R c =A p , Dp, and B p ; 

Xc=0p, Lp, and F p . The frequencies or Kq, Rq, and X c will be approx- 
imately equal because the summations of the frequencies of the several 
plaintext letters each of these cipher letters represents at different 
times will be about equal. If this same phenomenon were, true of all the 
letters of the cryptogram, it is clear that the frequencies of the 26 
letters, when shown by means of the ordinary uniliteral frequency dist- 
ribution, would show no striking differences and the distribution would 
have the flat appearance of a typical polyalphabetic substitution cipher. 
Hence, the general rule: The absence of marked crests and troughs in the 

uniliteral frequency distribution indicates that a complex form of sub - 
stitution is involved . The flattened-out appearance of the distribution , 
then, is one of the criteria for the rejection of a hypothesis of mono - 

alphabetic-^ substitution . 

d. The foregoing test based upon the appearance of the frequency 
distribution is only one of several means of determing whether a sub- 
stitution cipher is monoalphabetic or non-monoalphabetic in composition. 

It can be employed in cases yielding frequency distributions from which 
definite conclusions can be drawn with more or less certainty by mere 
ocular examination. In those cases in which the frequency distributions 
contain insufficient data to permit drawing definite conclusions by such 
examination, certain statistical tests can be applied. One of these 
tests, called the <J> (phi) test, warrants detailed treatment and is 
discussed in paragraph 27 below. 

e. At this point, however, one additional test will be given be- 
cause of its simplicity of application. This test, the A (lambda) or 
blank-expectation test , may be employed in testing messages up to 200 
letters in length, it being assumed that in messages of greater length 
ocular examination of the frequency distribution offers little or no 
difficulty. This test concerns the number of blanks in the frequency 
distribution, that is, the number of letters of the alphabet which are 
entirely absent from the message. It has been found from statistical 
studies that rather definite "laws" govern the theoretically expected 
number of blanks in normal plaintext messages and in frequency distribu- 
tions for cryptograms of different natures and of various sizes. The 
results of certain of these studijs have been embodied in Chart 6. 

f. This chart contains two curves. The one labeled P applies to the 
average number of blanks theoretically expected in frequency distributions 
based upon normal plaintext messages of the indicated lengths . The other 
curve, labeled R, applies to the average number of blanks theoretically 
expected in frequency distributions based upon perfectly random assort- 
ments of letters; that is, assortments such as would be found by random 



l^CfTT footnote 8 on page 40 . 
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selection of letters out of a hat containing thousands of letters, all of 
the 2 6 letters of the alphabet being present in equal proportions, each 
letter being replaced after a record of its selection has been made. Such 
random assortments correspond to polyalphabetic cipher messages in which 
the number of cipher alphabets is so large that if uniliteral frequency 
distributions are made of the letters, the distributions are practically 
identical with those which are obtained by random selections of letters 
out of a hat. 
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Chart 6. Curves showing the average number of blanks 
theoretically expected in distributions for plain text (P) 
and for random text (R) for messages of various lengths. 
(See subpar. 26f.) 



, g. In using this chart, one finds the point of intersection of the 
vertical coordinate corresponding to the length of the message, with the 
horizontal coordinate corresponding to the observed number of blanks in 
the distribution for the message. If this point of intersection falls 
closer to curve P than it does to curve R, the number of blanks in the 
message approximates or corresponds more”closely to the number theoreti- 
cally expected in a plaintext message than it does to a random (ciphertext) 
message of the same length} therefore, this is evidence that the crypto- 
• gram is monoalphabet ic . Conversely, if this point of intersection falls 
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closer to curve R than to curve P, the number of blanks in the message ap- 
proximates or corresponds more closely to the number theoretically expected 
in a random text than it does to a plaintext message of the same length; 
therefore, this is evidence that the cryptogram is non-monoalphabetic . 

27. The <f> (phi) test for determining monoalphabeticlty . — a. The 
student has seen in the preceding paragraph how it is possible to deter- 
mine by ocular examination whether or not a substitution cipher is mono- 
alphabetic. This tentative determination is based on the presence. of a 
marked crest -and -trough appearance in the uniliteral frequency distribu- 
tion, and also on the number of blanks in the distribution. However, when 
the distribution contains a small number of elements, ocular examination 
and evaluation becomes increasingly difficult and uncertain. In such 
cases, recourse may be had to a mathematical test, known as the <f> test, 
to determine the relative monoalphabeticlty or non-monoalphabetic ity of 
a distribution. 

b. Without going into the theory of probability at this time, or 
into the derivation of the formulas involved, let it suffice for the pre- 
sent to state that with this test the "observed value of (j>" (symbolized 
by 4>o) is compared with the "expected value of <j> random’ 1 (cj> r ) and the 
’’expected value of <j> plain" (<£p) . The formulas are (j> r s.0385N(lI-l) and, 
for English military text, <j>pr .o667W(N-l) , where N is the total number of 
elements in the distribution. 15 The use of these formulas is best illus- 
trated by an example, 

c. The following short cryptogram with its accompanying uniliteral 
frequency distribution is at hand: 

QCYCH ADSKS YZZQE CYKYK QZYSK 

LSZAC TKFCX LKLKC ESZMX KISZX 



= _ __ g = _ = ^=50 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 



15 The constant .0385 is the decimal equivalent of 1 / 2 6 , i.e., the 
reciprocal of the number of elements in the alphabet. The constant .0667 
is the sum of the squares of the probabilities of occurrence of the indi- 
vidual letters in English plain text. These constants are treated in 
detail in Military Cryptanalysis, Part II. 
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<f>Q is calculated "by applying the formula f(f-l) to the frequency (f) of 
each letter and' totaling the result j or, expressed in mathematical 
notation, <j> 0 = (f-l) . Thus, 

£f = 2 6121 11 831 3 61 3 5 6 = 50 

ABCDEFGHIJKLMROPQRSTUVWXYZ 
£f(f-l) = 2 300 20 00 56 6 0 6 30 0 6 2030= 188 



For this distribution, <j>r= .0385N(K-l) = .0385 x 50 x = 9k, and 

<J>p= .O667H (N-l) = ,0667 x 50 x k 9 = 163. 

Row since <j> 0 , 188, is in fact greater than 4>p, we have a mathematical 
corroboration of the hypothesis that the cryptogram is a monoalphabetic 
substitution cipher. If <j> 0 were nearer to <j) r , then the assumption would 
be that the cryptogram is not a monoalphabetic cipher. If were just 
half way between 4> r and cj>p, then decision would have to be suspended, 
since no further statistical proof in the matter is possible with this 
particular test .IT 

d. 1 Two further examples may be illustrated: 



(l) ABCDEFGHIJKLMROPQRSTUVWXYZ ? 

0 026122 0 3220 0 6 *f (f-l) =42 



' 1 ^ The more usual mathematical notation for expressing <j> 0 would be 
Z 

21 fi(fi-l), which is read as "the sum of all the terms for all integral 
1 =A Z 

values of f from A to Z inclusive . In turn, 2 ! fi(fi-l) would be expanded 

i=A 

as fA(fA~l)* fB(ik-l) ’►fcUc-l) 4 ** • • • .tfz(fZ-l) • However, in the 
interest of simplicity the notation £ f( f-l) is employedj likewise, the 
notations and <j> p are eusployed in lieu of the more usual E(<j>r) and E(<j>p), 



■*-T Another method of determining the relative monoalphabetic ity of a 
cryptogram is based upon comparing the index of coincidence (abbr. I.C .) 
of the cryptogram under examination with the theoretical I.C. of plain 
text. The I.C. of „a message is defined as the ratio of $ 0 to fa.? thus, 



in the example above, the I.C. is 



188 

-9H’ 



which equals 2 . The theoretical 



I.C. of English plain text is 1 . 73 , which is the decimal equivalent of 

the ratio of the "plain constant" to the "random constant". The 
.0385 ■* 

I.C* of random text is 1, i.e., . 038 ? . 

703*55 
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_ _ _ = _ _ = __ _ s Nc2 5 

( 2 ) ABCDEFGHIJKLMNOPQRSTUVWXYZ 

0 0 0 2 0 0 0.6 0 0 02 00 0 0*2 6 £f(f-l)=l8 

Since "both distributions have 25 elements, then for both 

4>r * .0385 x 25 x 24 = 21, and ” 

4 >p = .0667 x 25 x 2 k = 40 . 

Hence distribution (l) is monoalphabet ic , while (2) is not* 

e. The student must not assume that statistical tests in cryptanal- 
ysis are infallible or absolute in themselves 1 ^, statistical approaches 
serve only as a means to the end, in guiding the analyst to the most 
probably fruitful sources of attack. Since no one test in cryptanalysis 
gives definite proof of a hypothesis (in fact, not even a battery of tests 
gives absolute proof), all applicable statistical means at the disposal 
of the cryptanalyst should be used; thus, in examination for monoalphabet- 
icity, the <j> test, A test, and even other testsi 9 could profitably be 
employed. To illustrate this point, if the <j> test is taken on the 
distribution of the plaintext letters of .the phrase 

A QUICK BROWN FOX JUMPS OVER THE LAZY DOG 



~ = = N=33 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 
2 2 12 2 2 £f(f-l)«20 

^r = ^lj <j)p = 70 

it will be noticed that <j> 0 is less than half of 4 r> thus conclusively 
"proving" that the letters of this phrase could not possibly constitute 
plain text nor a monoalphabetic encipherment of plain text in any lan- 
guage! The student should be able to understand the cause of this 
cryptologic curiosity. 

18 The following quotation from the Indian mathematician P. C, Maha- 

lanobis, concerning the fallibility of statistics, is particularly 
appropriate in this connection: "If statistical theory is right, predic- 

tions must sometimes come out wrong; on the other hand, if predictions 
are always right, then the statistical theory must be wrong." — Sankhya, 
Vol. 10 , Part 3 , p. 203 . Calcutta, 1950 . 

1 9 One of these, the chi-square test , will be treated in a subsequent 
text. 
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28. Determining -whether a cipher alphabet is standard (direct or 
reversed) or mixed . --'a. Assuming that the uniliteral frequency distri- 
bution for a given cryptogram has been made, and that it shows clearly 
that the cryptogram is a substitution cipher and is monoalphabetic in 
character, a consideration of the nature of standard cipher alphabets^ 
almost makes it obvious how an inspection of the distribution will dis- 
close whether the cipher alphabet involved is a standard cipher alphabet 
or a mixed cipher alphabet. If the crests and troughs of the distribu- 
tion occupy positions which correspond to the relative positions they 
occupy in the normal frequency distribution, then the cipher alphabet 
is a standard cipher alphabet. If this is not the case, then it is 
highly probable that the cryptogram has been prepared by the use of a 
mixed cipher alphabet. A mechanical test maybe applied in doubtful 
cases arising from lack of material available for study; just what this 
test involves, and an illustration of its application will be given in 
the next section, using specific examples. 

b. Of course, if it has been determined that a standard cipher al- 
phabet is involved in a particular instance, it goes without saying that 
at the same time it must have been found whether the alphabet is a direct 
standard or reversed standard cipher alphabet. The difference between the 
distribution of a direct standard alphabet cipher and one of a reversed 
standard alphabet cipher is merely a matter of the direction in which the 
sequence of crests and troughs progresses — to the right, as is done in 
normally reading or writing the alphabet (ABC rft Z), or to the left, 
that is, in the reversed direction (Z .f. C B A). With a direct standard 
cipher alphabet the direction in which the crests and troughs of the dis- 
tribution progress is the normal direction, from left to right; with a 
reversed standard cipher alphabet this direction is reversed, from right 
to left. 



See par. 12. 
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SECTION V 

UNILITERAL SUBSTITUTION WITH STANDARD CIPHER ALPHABETS 



Paragraph 

Types of standard cipher alphabets •*•••••«•••• 29 

Procedure in enclphermsnt and decipherment by means of 

unlliteral substitution. 30 

Principles of solution by construction and analysis of the 

uniliteral frequency distribution............*..* 31 

Theoretical example of solution. ...... 32 

Practical example of solution by the frequency method........ 33 

Solution by completing the plain-component sequence................. 3 ^ 

Special remarks on the method of solution by completing the 

plain-component sequence 39 

Value of mechanical solution as a short cut..... 36 

Basic reason for the low degree of cryptosecurity afforded by 

monoalphabetic cryptograms involving standard cipher alphabets... 37 

29 . Types of standard cipher alphabets .—a. Standard cipher 
alphabets are of two types: 

(1) Direct standard , in which the cipher component is the normal 
sequence but shifted to the right or left of its point of coincidence in 
the norma], alphabet. Example: 

— 

Plain: ABCIEFGHIJK 1 MI 0 PQPSTUVWXYZ 

Cipher: QRSTUVVrXYZABCDEFGHIJKLMNOP 

* 

It Is obvious that the cipher component can be applied to the plain 
component at any one of 26 points of coincidence, but since the alphabet 
that results from one of these applications coincides exactly with the 
normal alphabet, a series of only 25 (direct standard) cipher alphabets 
results from the shifting of the cipher component, 

(2) Reversed standard , in which the cipher component is also the 
normal sequence but runs in the opposite direction from the normal. 
Example: 

> 

Plain: ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Cipher: QPOIMtKJIHGFEDCBAZmmJTSR 

< 

Here the cipher component can be applied to the plain component at any 
of 26 points of coincidence, each yielding a different cipher alphabet. 
There is inthis case, therefore, a series of 2 6 (reversed standard) 
cipher alphabets. 
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b. It is often convenient to refer to or designate one of a series 
of cipher alphabets without ambiguity or circumlocution. The usual me- 
thod is to indicate the particular alphabet to which reference is made by 
citing a pair of equivalents in that alphabet, such as, in the example 
above, ApsQ c * The key for the cipher alphabet just referred to, as 

well as that preceding it, is ApzQc, and it is said that the key letter 
for the cipher alphabet is Qq. 

c. The cipher alphabet in subpar. a(2), above, is also a reciprocal 
alphabet ; that is, the cipher alphabet contains 13 distinct pairs of equiv- 
alents which are reversible. For example, in the alphabet referred to, 
ApsQc and Qp=Ac; BpsP c and Pp=B c , etc. The reciprocity exists through- 
out the alphabet and is a result of the method by which it was formed. 
(Reciprocal alphabets may be produced by juxtaposing any two components 
which are identical but progress in opposite direction's.’) 

30. Procedure in encipherment and decipherment by means of uni - 
literal substitution. — a. When a message is enciphered by means of 
unlllteral substitution , or simple substitution (as it is often called), 
the individual letters of the message text are replaced by the single- 
letter equivalents taken from the cipher alphabet selected by prearrange- 
ment. Example: 

Message: EIGHTEEN PRISONERS CAFTURFD 

Enciphering alphabet: Direct standard, Ap=T c 

Plain: ABCDEFGHIJKLMNOPQEISTUV^'TXXZ 

Cipher: TUVWXYZABCDEFGHIJKLMNOPQRS 

Letter-for-letter encipherment: 

EIGHTEEN PRISONERS CAPTURED 
XBZAMXXG IKBLI3GXKL VTIMNKXW 

The cipher text 1 b then regrouped, for transmission, into groups of five. 
Cryptogram: 

XBZAM XXGIK BLHGX KLVTI MNKXW 

b. The procedure in decipherment is merely the reverse of that in 
encipherment . The cipher alphabet selected by prearrangemenb is set up 
with the cipher component arranged in the normal sequence and placed above 
the plain component for ease in deciphering. The letters of the crypto- 
gram are then replaced by their plaintext equivalents, as shown below. 

Cipher: ABCDEFGHIJKUINOPQPSTUVWXYZ 

Plain: HIJmnTOPQRSTUVWXIZABCnEFG 

The message deciphers thus: 

Cipher: XBZAM XXGIK BLHGX KLVTI MNKXW 

Plain: EIGHT EENFR ISONE RSCAP TUBED 

The deciphering clerk rewrites the text in word lengths: 

EIGHTEEN PRISONERS CAPTURED 
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c, In subpar, a, above, the ci yptogram was prepared In final form 
for transmission by dividing the cryptographic text into groups of five* 
This is generally the case in military communications Involving cipher 
systems. It promotes enc' racy in telegraphic transmission since an opera- 
tor knows he must receive <=, definite number of characters in each group, 
no more and no less. Also, x*c /ually makes solution of the messages by 
unauthorized persons more difficult because the length of the words, 
phrases, and sentences of the plain text is hidden. If the last group of 
the cipher text in subpar . 30 a had not been a complete group of five 
letters, it might have been completed by adding a sufficient number of 
meaningless letters (c tilled nulls ) . 

31. Principles of solution by construction and analysis of the 
unlliteral frequency distribution .— a. The analysis of monoalphabetlc 
cryptograms prepared by the use of standard cipher alphabets follows al- 
most directly from a consideration of the liature of such alphabets. Since 
the cipher component of a standard cipher alphabet consists either of the 
normal sequence merely displaced 1, 2, 3* • « • Intervals from the normal 
point of coincidence, or of the normal sequence proceeding in a reversed- 
normal direction, it is obvious that the uniliteral frequency distribution 
for a cryptogram prepared by means of such a cipher alphabet employed 
monoalphabetically will show crests and troughs whose relative positions 
and frequencies will be exactly the same as in the uniliteral frequency 
distribution for the plain text of that cryptogram. The only thing that 
ha3 happened is that the whole set of crests and troughs of the distribu- 
tion has been displaced to the right or left of the position it occupies 
in the distribution for the plain text; or else the successive elements of 
the whole set progress in the opposite direction. .Hence,- it follows that ’ 
the correct determination of the plaintext value of the cipher letter 
marking any crest or trough of the uniliteral frequency distribution, 
coupled with the correct determination of the relative direction in which 
the plain component sequence progresses, will result at one stroke in the 
correct determination of the plaintext values of all the remaining 25 
letters respectively raarking the other crests and troughs in that dis- 
tribution. The problem thus resolves itself into a matter of selecting 
that point of attack which, will most quickly or most easily lead to the 
determination of the value of one cipher letter. The single word 
ident if ic at ion will hereafter be used for the phrase "determination of 
the value of a cipher letter"; to identify a cipher letter is to find 
its plaintext value. 

b. It is obvious that the easiest point of attack is to assume that 
the letter marking the crest of greatest frequency in the frequency 
distribution for the cryptogram represents Ep* Proceeding from this 
initial point, the identifications of the remaining cipher letters marking 
the other crests and troughs are tentatively made on the basis that the 
letters of the cipher component proceed in accordance with the normal 
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alphabetic sequence, either direct or reversed. If the actual frequency 
of each letter marking a crest or a trough approximates to a fairly close 
degree the normal or theoretical frequency of the assumed plaintext 
equivalent, then the initial identification ©c=Ep may he assumed to he 
correct and therefore the derived identifications of the other cipher 
letters also may he assumed to he correct A If the original starting 
point for assignment of plaintext values is not correct, or if the direc- 
tion of “reading" the successive crests and troughs of the distribution 
is not correct, then the frequencies of the other 25 cipher letters will 
not correspond to or even approximate the normal or theoretical frequencies 
of their hypothetical plaintext equivalents on the basis of the initial 
identification. A new initial point, that is, a different cipher equiv- 
alent, must then he selected to represent Ejj; or else the direction of 
“reading" the crests and troughs must he reversed. This procedure, that 
is, the attempt to make the actual frequency relations exhibited by the 
uniliteral frequency distribution for a given cryptogram conform to the 
theoretical frequency relations of the normal frequency distribution in a a 
effort to solve the cryptogram, is referred to technically as "fitting the 
actual uniliteral frequency distribution for a cryptogram to the theoret- 
ical uniliteral frequency distribution for normal plain text", or, more 
briefly, as “fitting the frequency distribution for the c r yptogram to the 
normal frequency distribution ' 1 or,' still more briefly, ' fitting the dis- 
tribution to the normal ." In statistical work the expression commonly 
employed in connection with this process of fitting an actual distribu- 
tion to a theoretical one is "testing the goodness of fit." The goodness 
of fit may be stated in various ways, mathematical in character. 2 

£. In fitting the actual distribution to the normal, it is neces- 
sary to regard the cipher comporent (that is, the letters A . . , Z marking 
the successive crests and troughs of the distribution) as partaking of 
the nature of a circle, that is, a sequence closing in upon itself, so that 
no matter with what crest or trough one starts, the spatial and frequency 
relations of the crests and troughs are constant. This manner of regard- 
ing the cipher component as being cyclic in nature is valid because it 
is obvious that the relative positions and frequencies of the crests and 
troughs of any uniliteral frequency distribution must remain the same 
regardless of what letter is employed as the initial point of the distri - 
bution . Fig. 5 gives a clear picture of what is meant in this connect- 
ion, as applied to the normal frequency distribution. 



The Greek letter 0 (theta) is used to represent a character or 
letter without indicating its identity. Thus, instead of the circum- 
locution "any letter of the plain text", the symbol 8p is used; and for 
the expression "any letter of the cipher text", the symbol 0 C is used. 

2 One of these tests for expressing the goodness of fit, the % (chi) 
test, will he treated in Military Cryptanalysis, Part II. 
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Figure 5* 



d. Ia the third sentence of subparagraph b, the phrase "assumed to 
he correct" was advisedly employed in describing the results of the at- 
tempt to fit the distribution to the normal, because the final test of 
the goodness of fit in this connection (that is, of the correctness of 
the assignment of values to the crests and troughs of the distribution) 
is whether the consistent substitution of the plaintext values of the ci- 
pher characters in the cryptogram will yield intelligible plain text. If 
this is not the case, then no matter how close the approximation between 
actual and theoretical frequencies is, no matter how well the actual fre- 
quency distribution fits the normal, the only possible inferences are 
that (l) either the closeness of the fit is a pure coincidence in this 
case and that another equally good fit may be obtained from the same data, 
or else (2) the cryptogram involves something more than simple monoalpha- 
betic substitution by means of a single standard cipher alphabet* For 
example, suppose a transposition has been applied in addition to the 
substitution, Then, although an excellent correspondence between the 
unlliteral frequency distribution and the normal frequency distribution 
has been obtained, the substitution of the cipher letters by their assumed 
equivalents will still not yield plain text. However, aside from such 
cases of double encipherment, instances in which the uniliteral frequency 
distribution may be easily fitted to the normal frequency distribution 
and in which at the same time an attempted simple substitution fails to 
yield intelligible text are rare. It may be said that. In practical 
operations whenever the uniliteral frequency distribution can he made to 
fit the normal frequency distribution, substitution of values will result 
in solution; and, as a corollary, whenever the uniliteral frequency 
distribution cannot be made to fit the normal frequency distribution, 
the cryptogram does not represent a case of simple, monoalphabetic 
substitution by means of a standard alphabet. 
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32 « Theoretical example of solution *-- a« The foregoing principles 
will become clearer by noting the encryption and solution of a theoretical 
example. The following message is to be encrypted. 

HOSTILE FORCE ESTIMATED AT ONE REGIMENT BfFANTRY AND TWO PLATOONS 
CAVALRY MOVING SOUTH ON QUINN IMONT PIKE STOP HEAD OF COLUMN NEARING ROAD 
JUNCTION SEVEN THREE SEVEN COMMA EAST OF GREENACRE SCHOOL FIRED UPON BY 
OUR PATROIS STOP HAVE DESTROYED BRIDGE OVER INDIAN CREEK. 

b. First, solely for purposes of demonstrating certain principles, 
the unillteral frequency distribution for this plaintext message is 
presented in Figure 6. 




ABCDEFGHIJKLMNOPQRSTUVWXYZ 



Figure 6. 



a. Now let the foregoing message be encrypted monoalphabetically by 
the following standard cipher alphabet, yielding the cryptogram shorn 
below and the frequency distribution shovn in Figure 7* 



Plain -ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Cipher - - - GHIJKLMNOPQRSTUVWXYZABCDEF 

Plain - - - HOSTI LEFOR CEEST BATE DATON EREGI MEBTI NFANT RYAND 

Cipher - - NUYZO RKLUX IKKYZ OSGZK JGZUT KXKMO SKTZO TLGTZ XEGTJ 

Plain TWOPL ATOON SCAVA LRYMO VMGS OUTHO NQUIN NBION TP IKE 

Cipher - - ZCUVR GZUUT YIGBG HXESU BCTMY HAZNU TWAOT TOSUT ZVOQK 

Plain STOPH EADOF COLUM NIIEAR IHGRO ADJCJN CTION SEVEN THREE 

Cipher - - YZUVN KGJUL IURAS TTKGX OTMXU GJPAT IZOUT YKBKT ZNXKK 

Plain - - - SEVEN COMMA EASTO FGREE NACRE SCHOO IFIRE DUPON BYOUR 

Cipher - - YKBKT IUSSG KGYZU BMXKK TGIXK YINUU RLOXK JAVUT HEUAX 

Plain PATRO LSSTO PHAVE DESTR OYEDB RIDGE OVERI NDIAN CREEK 

Cipher - - VGZXU RYYZU VNGBK JKYZX UEKJH XOJMK UBKXO TJOGT KKKQ 



Cryptogram 



NUYZO 
SKTZO 
R X E S U 
YZUVN 
IZOUT 
LMXKK 
VGZXU 
U B K X 0 



RKLUX 
TLGTZ 
BOTKY 
K G J U L 
YKBKT 
TGIXK 
RYYZU 
TJOGT 



IKKYZ 
XEGTJ 
U A Z N U 
IURAS 
ZNXKK 
YINUU 
VNGBK 
IXKKQ 



OSGZK 
ZCUVR 
T W A 0 T 
TTKGX 
YKBKT 
RLOXK 
JKYZX 



JGZUT 

GZUUT 

TOSUT 

OTMXU 

IUSSG 

JAVUT 

UEKJH 



KXKMO 

YIGBG 

ZVOQK 

GJPAT 

KGYZU 

HEUAX 

XOJMK 
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DEFGHIJKLMNOPQRSTUVWXYZ 



Figure 7« 



d. Let the student now compare Figs* 6 and 7# which, have teen su- 
perimposed in Fig* 8 for convenience in examination. Crests and troughs 
are present in both distributions; moreover their relative positions and 
frequencies have not been changed in the slightest particular. Only the 
absolute position of the sequence as a whole has been displaced six 
places to the right in Fig. 7, as compared with the absolute position of 
the sequence in Fig. 6, 



g g § gg g g 

g 5 § g ~.g ^1?. g g g _ 

3 Sr 55 3 3 S5 ^ 55 5E 55 5 

ABCDEFGHIJKLMN OPQRS TU VWXYZ 

t 

0*1 « Xi3 ^ ^ 

i g = g 

* g ___ g g ^ | 

g =£ § g ^ ! ^ = ggjggg 

g g - § g = g g g g g g g ^ =5 g g g g g » g g g 

ABCDEFGHIJ KLMNOPQRSTUVWXYZ 

Figure 8. 



e. If the two distributions are compared in detail the student will 
clearly understand how easy the solution of the cryptogram would be to one 
who knew nothing about how it was prepared. For example, the frequency of 
the highest crest, representing Ep in Fig, 6 is 28; at an interval of four 
letters before Ep there is another crest representing Ap with frequency 
l6. Between A and E there is a trough, representing the medium-frequency 
letters B, C, D. On the other side of E, at an interval of four letters, 
comes another crest, representing I with frequency 1^. Between E and I 
there is another trough, representing the medium- frequency letters F, G, II. 
Compare these crests and troughs with their homologous crests and troughs 
in Fig. 7* In the latter, the letter K marks the highest crest in the 
distribution with a frequency of 28; four letters before K there is an- 
other crest, frequency l6, and four letters on the other side of K there 
is another crest, frequency l4. Troughs corresponding to B, C, D and F, 

G, H are seen at H, I, J and L, M, H in Fig, 7* In fact, the two dis- 
tributions may be made to coincide exactly, by shifting the frequency 
distribution for the cryptogram six places to the left with respect to 
the distribution for the equivalent plaintext message, as shown herewith. 
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a ^ g g g ^ ^ 

I g g § gg g =5 g 

6 g s § j ^g g|||| ISlig = 

GHIJKLMNOPQRSTUVWXYZABCDEF 

Figure 9» 

f . Let us suppose now that nothing is known about the process -of 
encryption, and that only the cryptogram and its uniliteral frequency- 
distribution is at hand. It is clear that simply bearing in mind the 
spatial relations of the crests and troughs in a normal frequency dis- 
tribution would enable the cryptanalyst to fit the distribution to the 
normal in this case. He would naturally first assume that K c sEp, from 
which it would follow that if a direct standard alphabet is involved, 
L c sFp,Mc=Gp, and so on, yielding the following (tentative) deciphering 
alphabet: 



Cipher ABCDEPGEI JK1MH0PQRSTUVUXYZ 

Plain UVWXYZABCDEFGHIJKLMHOFQRST 

* < 

• | i 

jg. How comes the final tesui tf -these assumed values are substi- 
tuted in the cipher text, the plain texfc immediately appears. Thus: 

NUYZO RKLUX IKKYZ OSGZIC JGZUT etc. 

HOSTI LEFOR CEEST I MATE DATOH etc. 

h. It should be clear, therefore, Lhat the initial selection of 
Gc as the specific key (that is, to represent Ap) in the process of 
encryption has absolutely no effect upon the relative spatial and 
frequency relations of the crests and troughs of the frequency distri- 
bution for the cryptogram. If Qc had been selected to represent Ap, 
these relations would still remain the same, the whole series of crests 
and troughs being merely displaced further to the right of the positions 
they occupy when G c =Ap. 

33. Practical example of solution by the frequency method , ~ 
a. The case of direct standard alphabet ciphers . - - (l) The following 
cryptogram is to be solved by applying the foregoing principles: 



IVHVH 

W B N J B 



C A X X Y 
C X 0 W N 



B J C C J 
F C X W B 



L T R W P 
C X Y Y H 



X D A Y X 
C II A B L 



B R C R X 
X U R U 0 
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(2) From the presence of so mar../ low-frequency letters such as B, 

W , and X it is at once suspected that this is a substitution cipher. But 
to illustrate the steps, that must he taken in difficult cases in order to 
he certain in this respect, a uniliteral frequency distribution is con- 
structed, and then reference is r^fte to Charts 2 to 5 to note whether the 
actual numbers of vowels, high-, meujLu__-, and low-frequency consonants 
fall inside or outside the areas S-' limited by the respective curves. 



_ _ b = = 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 

r»» 

Figure 10 a. 




Vowels (A E I 0 U Y) 

High-frequency Consonants (D N R S T) 

Medium-frequency Consonants (B C F G H L M P V W) 
Low-frequency Consonants i( J K Q X Z) 



Total 



w , Position with respect to areas 

Frequency de i im i te d by curves 



Outside, chart 1. 
Outside, chart 2. 
Outside, chart 3. 
Outside, chart 4. 




(3) All four points falling completely outside the areas delimited by 
the curves applicable to these four classes of letters, the cryptogram is 
clearly a substitution cipher. 

(4) The appearance of the frequency distribution, with marked crests 
and troughs, indicates that the cryptogram is probably monoalphabetic . At 
this point the <f> test is applied to the distribution. The observed value 
of <p is found Hbo be 258, while the expected value of <J> plain and <f) random 
are calculated to be 23© and 136, respectively. The fact that the ob- 
served value is not only closer to but greater than <f>p is taken as 
statistical evidence that the cryptogram is monoalphabetic. Furthermore, 
reference being made to Chart 6, the point of intersection of the message 
length (60 letters) and the number of blanks (8) falls directly on 

curve P; this is additional evidence that the message is probably mono- 
alphabetic . 

(5) The next step is to dete rmin e whether a standard or a mixed 
cipher alphabet is involved. This is done by studying the positions and 
the sequence of crests and troughs in the frequency distribution, and 
trying to fit the distribution to the normal. 
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(6) The first assumption to be made is that a direct standard cipher 
alphabet is involved. The highest crest in the distribution occurs over 
Let it be assumed that Then Yq^ Zq, Aq, .... "Pp^ Gp, 

Hp, . . . ., respectively; thus: “ 



_ _ £ = S 

Cipher. ...A BCDEFGHIJKLMNOPQRSTUV¥XYZ 
Plain.. ...H IJKLMNOPQRSTUVWXYZABCDEFG 

Figure 10b. 

It may be seen quickly that the approximation to the expected frequencies 
is very poor. There are too many occurrences of J p , Qp, U p and Fp and 
too few occurrences of Up, Op, R p , S p , T p and A p . Moreover, if a substi- 
tution is attempted on this basis, the following is obtained for the first 
two cipher groups: 

Cipher N ¥NVHCAXXY 

“Plain texfc“U DUCO JHEEF 

This is certainly not plain text and it seems clear that X c is not E_, if 
the hypothesis of a direct standard alphabet cipher is correct. A p 
different assumption will have to be made. 

(7) Suppose Cc=Ep. Going through the same steps as before, again 
no satisfactory results are obtained. Further trials3 are made along the 
same lines, until the assumption N c eEp is tested: 



= 22 - _ _ = « ^ = _ = = 

Cipher. ...A BCDEFGHIJKLMNOPQRSTUV¥XYZ 
Plain RSTTJVWXYZABCDEFGHIJKLMHOPQ 

r 

Figure 10c. 

(8) The fit in this case is quite good; possibly there are too few 
occurrences of Ap, Bp, and Rp. But the final test remains: trial of the 
substitution alphabet on the cryptogram itself. This is done and the 
results are as follows: 



C: 


N ¥ N V H 


C A X X Y 


B J C C J 


L T R ¥ P 


X D A Y X 


BRC 


R 


X 


F: 


ENEMY 


TROOP 


S A T I A 


C K I N G 


0 U R P 0 


SIT 


I 


0 


C: 


W B N J B 


C X 0 W N 


F C X W B 


C X Y Y N 


C N A B L 


X U R 


¥ 


0 


P: 


N S E A S 


T 0 F N E 


¥ T 0 N S 


T 0 P P E 


T E R S C 


0 L I 


N 


F 



ENEMY TROOPS ATTACKING OUR POSITIONS EAST OF NEWTON. PETERS COL INF. 



3 It is unnecessary, of course, to write out all the alphabets and 
pseudo-decipherments, as shown above, vhen testing assumptions. This is 
usually done mentally. 
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(9) It is always advisable to note the specific key. In this case 
the correspondence between any plaintext letter and. its cipher equivalent 
will indicate the key. Although other conventions are possible, end 
equally valid," it is usual, however, to Indicate the key by noting the 
cipher equivalent of Ap. In this case Ap=J c « 

b, The case of reversed standard alphabet ciphers . -~(l) Let the 
following cryptogram and, its frequency distribution he studied, 

F W F X L QSVVU R J Q Q J H Z B W D V P S U V RBQBV 

WRFJR QVEWF MQVHR QVUUF QFSRH VYBWE 

(2) The preliminary steps illustrated above, under subpar „ a (l) 

to (4) inclusive, in connection with the test for class and monoalphabot- 
icity, will here he omitted, since they are exactly the same In nature • 
The result is that the cryptogram is obviously a substitution cipher and 
is monoalphabetic. 

(3) Assuming that it is not known whether a direct or a reversed 
standard alphabet is involved, attempts are at once made to fit the fre- 
quency distribution to the normal direct sequence • If the student vill 
try them he will soon find out that these are unsuccessful. All this 
takes but a few minutes. 

(4) The next logical assumption is near m 3 .de, viz ., that the cipher 
alphabet is a reversed standard alphabet. When on this basis F c is 
assumed to be Ep, the distribution con readily be fitted to the normal, 
practically every crest and trough in the actual distribution correspond- 
ing to a crest or trough in the expected distribution. 



s 

cipher*. ..A BCDEFGHIJKLMNOPQRSTUVWXYZ 
Plain JIHGFEDCBAZYXWVUTSRQPONMLK 

Figure lOd. 

( 5 ) When the substitution is made in the cryptogram, tlxe following 
is obtained. 

Cryptogram. ..F W.FXL QSVVU RJQQJ 

Plain text. ..E HEMY TROOP SATTA 

( 6 ) The plaintext; massage is identical with that in subpar . a. The 
specific key in this case is also ApCJ c . If the student vill compare the 
frequency distributions in the t^ro cases, lie will note that the relative 
positions and extents of the crests and troughs are identical; they 
merely progress in opposite directions . 
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c. General note on solution by the frequency method .-- >In actual 
practice, the procedure of subpars . a and b are given a more rapid treat- 
ment than that just described, the practical treatment being based, not 
on the initial finding of some single crest or trough, but rather on locat- 
ing the more readily-discernlble clusters of crests which usually appear 
in a distributio n, su ch as the distinctive crest-patterns representing 
"A. . .E» . .I" and "RST", These crest-patterns are searched for, with a 
quick scanning of the distribution, and then the relative placement with 
respect to each other is tested to see if it conforms to the expectation 
for a direct standard cipher alphabet, and, if not, then for a reversed 
standard cipher alphabet. During this latter step, which consists "of 
little more than counting in one direction and then (when necessary) in 
the other, the blank (or near ly-bla nk) expectation of "uK" followed by 
the characteristic curve for "IMTUF" and the blank "Q" are considered, 
as a means of either substantiating or invalidating the original 
"identification” of the crests. 



3^. Solution by completing the plain-component sequence . — 
a. The case of direeb standard alphabet cipher s.— (l) The foregoing 
method of analysis, involving as it does the cons trur+.ion of a uniliteral 
frequency distribution, vas termed a solution by the frequency method 
because it involves the construction of a frequency distribution and its 
study. There is, however, another method which is much more rapid, almost 
wholly mechanical, and which, moreover, does not necessitate the con- 
struction or study of any frequency distribution whatever. An under- 
standing of the method follows from a consideration of the method of 
encipherment of a message by the use of a single, direct standard cipher 
alphabet. 

( 2 ) Note the following encipherment: 

Message TWO CRUISERS SUNK 

Enciphering Alphabet 

ABCDBFGHIJKLMH05QRSTUVWXYZ 
GHIJKLMHOPQRSTUVWXYZABCDEF 



Plain — - 
Cipher — 



Encipherment 



Plain text— TWO 

Cryptogram Z C U 

Cryptogram 



CRUISERS 

IXAOYKXY 



SUITE 
Y A T Q 



ZCUIX AOYKX YYATQ 

(3) The enciphering alphabet sho'm above represents a case wherein 
the sequence of Hesters of both components of the cipher alphtbet is the 
normal sequence, with the sequence forming the cipher component merely 
shifted six places to the left (or 20 p.siticr.s to tlie right) of the 
position it occupies in the normal alphabet. If, therefore, two r, trips 
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of paper bearing the letters of the normal sequence, equally spaced, are 
regarded aa the two components of the cipher alphabet and are juxtaposed 
at all of the 25 possible points of coincidence, it is obvious that one 
of those 25 juxtapositions must correspond to the. actual juxtaposition 
shown in the enciphering alphabc' .directly above . It is equally obvious 
that if a record were kept of the results obtained by applying the values 
given at each juxtaposition to the letters of the cryptogram, one of these 
results would yield the plain text of the cryptogram. 



(4) Let the work be* systematized and the results set down in an 
orderly manner for examination. It is obviously unnecessary to juxtapose 
the two components so that Ac=Ap, for on the assumption of a direct stan- 
dard alphabet, juxtaposing two direct normal components at their normal 
point of coincidence merely yields .plain text. The next possible juxta- 
position, therefore, is Ac=Bp. Let the juxtaposition of the two sliding 
strips therefore be A c =Bp, as shown here: 



Plain ABCDEPGHIJKtMOPQPSTimmB 

C ipher ABCDE3 ? GniJKl^IN0PQRSTUWXYZABGDEFGHI<lKIl®0PQPSTlMIXYZ 



The values given by this juxtaposition ore substituted for the letters 
of the cryptogram and the following results are obtained. 

Cryptogram ZCUIX AOYKX YYATQ 

1st Test —"Plain "text " ADVJY BPZLY ZZBUR 



This certainly is not intelligible text} obviously, the two components 
were not in the position indicated in this first test. The plain 
component is therefore slid one interval to the left, making Ac=Cp, and 
a second test is made. Thus 



Plain ABCDEFGHIJKLMNOPQRSTUVWXYZ 

C ipher ABCDEFGHIJTCLMNOPQPSTUVWXYZABCDEFGHIJKLMKrOPQRSTUVWXYZ 

Cryptogram ZCUIX AOYKX YYATQ 

2d Test — ■"Plain text’ 1 B E W K Z C Q A M Z A A C V S 



Neither does the second test result in disclosing any plain text. But, 
if the results of the two tests are studied a phenomenon that at first 
seems quite puzzling comes to light. Thus, suppose the results of the 
two tests are superimposed in this fashion. 

Cryptogram ZCUIX AOYKX YYATQ 

1st Test— "Plain text" ADVJY B P Z L Y ZZBUR 

2d Test "Plain text" BEWKZ CQAMZ AACVS 

^ One of the strips should bear the sequence repeated. This permits 
juxtaposing the two sequences at all 2 6 possible points of coincidence 
so as to have a complete cipher alphabet showing at all times. 
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(|j) Note what has happened. The net result of the two experiments 
was merely to continue the normal sequence begun by the cipher letters at 
the heads of the columns of letters. It is obvious that if the normal 
sequence is completed in each column the results will be exactly the same 
as though the whole set of 25 possible tests had actually been performed . 
Let the columns therefore be completed, as shown in Fig. 11. 

ZCUIXAOYKXYYATQ 
A D V JYBPZLYZZBUR 
BEWKZC QAMZAACV S 
CFXLADRBNABBDWT 
DGYMBESCOBCCEXU 
EHZNCFTDPCDDFYV 
FIAODGTJEQDEEGZW 
. G JBPEIVFEEFFHAX 
HKCQFIWGSFGGIBY 
ILDRGJXHTGHHJC Z 
JMESHKYIUHIIKDA 
KNFTILZJVIJJLEB 
LOGUJMAKWJKKMFC 
MPHVKNBLXKLLNGD 
NQIWLOCMYLMMOHE 
ORJXMPDNZMNNPIF 
PSKYNQEOANOOQJG 
QTLZORFPBOPPRKH 
RUMAPSGQCPQQSLI 
SVNBQTHRDQBRTMJ 
*TWOCRUISERSSUNK 
UXPDSVJTFSTTVOL 
YTQETWKUGTUtJWPM 
WZRFUXLVHUVVXQN 
XASGVYMWIVWWYRO 
YBTHWZNXJNXXZSP 

Figure 11. 

An examination of the successive horizontal lines of the diagram discloses 
one and only one line of plain text, that marked by the asterisk and read- 
ing TWOCRUISERSSUNK. 

(6) Since each column in Fig. 11 is nothing but a normal sequence, 
it is obvious that instead of laboriously xrriting down, these columns of 
letters every time a cryptogram is to be examined, it would be more con- 
venient to prepare a set of strips each bearing the normal sequence 
doubled (to permit complete coincidence for an entire alphabet at any set- 
ting), and have them available for examining any future cr yptograms . In 
using such a set of sliding strips in order to solve a cryptogram prepared 
by means of a single direct standard cipher alphabet, or to mke a test to 
determine whether a cryptogram has been so prepared, it is only necessary 
to "set up" the letters of the cryptogram on the strips, that is, align 
them in a single row across the strips (by sliding the individual strips 
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up or down). The successive horizontal lines, called generatrices (sin" 
gular, generatrix ) 5, are then examined, in a search for intelligible text. 

If the cryptogram really belongs to this simple type of cipher, one of 
the generatr 3 ces ■will exhibit intelligible text all the way across; this 
text trill practically invariably be the plain text of the message . This 
method of analysis may be termed a ioiut '.on by completing the plain - 
component sequence . Sometimes it is referred to as "running down” "* the 
sequence. The principle upon which the method is based constitutes one 
of the cryptanalysts most valuable tools.® 

b. The case of reversed standard alphabets . — (l) The method describ- 
ed under subpar. a may also be applied, in slightly modified form, in the 
case of a cryptogram enciphered by a single reversed standard alphabet. 

The basic principles are identical in the tiro cases, as will now be demon- 
strated. 

(2) Let two sliding components be prepared as before, except that in 
this case one of the components must be a reversed normal sequence, the 
other, a direct normal sequence. 

(3) Let the two components be juxtaposed A to A, as shorn below, 
and then let the resultant values be substituted for the letters of the 
cryptogram. Thus : 

CRYPTOGRAM 

IT K S 13 P MYOCP 0 0 MTU 

Plain ABCDEITGHIJKLMROPQRSTIRA-JXYZ 

C ipher ZYJGTVUrSRQPOM/lIKJinGFEDCBAZYXI-rVTITSRQPOITMLIKJIIKrlEDCBA 

Cryptogram————— HKSEP MYOCP 0 0 MTU 
1st Test— "Plain text 1 ' IT Q I IT L 0 C M Y L M M 0 H E 

(k) This does not yield intelligible text, and therefore the revers- 
ed component is slid one space forvard and a second test is made. Thus: 

Plain ABCDEJjXJHIJKLMITOPQRSTUIATXYZ 

Cipher ZYwmssR^m'tiKjmGmiicmzY^-NmsRc^Qmiii^mimw'BA 

Cryptogram ' HKSE° MYOCP 0 0 M T ¥ 

2d Test ’‘Plain text 11 0 R J X M P D II Z M IT IT P I F- 

( 5 ) Neither does the second, test yield intelligible text. But let 
the results of the two tests be superimposed. Thus: 

Cryptogram. HKSEP MYOCP OOMTII 

1st Test— "Plain text" N Q I ¥ L OCMIL M M 0 iIE 
2d Test ’’Plain text” ORJXM PDNZM HHPIP 



^ Pronounced: j&f' er-a-trx^s ez and jen^ er-a^triks , respectively. 

^ A set of heavy paper strips, suitable for use in completing the 
plain-component sequence, has been prepared for use as a training aid in 
connection with the courses in Military Cryptanalysis. 
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(6) It is seen that the letters ^ th« "plain text" given by the 
second trial are merely the continuants of* the normal sequences initiated 
by the letters of the "plain text" given “by the first trial. If these 
sequences are "run down" — that is, completed within the columns— the 
results must obviously he the same as though successive tests exactly 
similar to the first two were applied to the cryptogram, using one 
reversed normal and one direct normal component. If the cryptogram has 
really been prepared hy means of a single reversed standard alphabet, 
one of the generatrices of the diagram that results from completing the 
sequences must yield intelligible text. 

(T) Let the diagram be made, or better yet, if the student has 
already at hand the set of sliding strips referred to in footnote 6 to 
page 69, let him "set up" the letters given by the first trial. Fig* 12 
shows the diagram and indicates the plaintext generatrix. 

NKSEPMYOCPOOMTW 
NQIWLOCMYLMMOHE 
OR JXMPDNZMOPIF 
PSKYNQEOANOOQJG 
QTLZORFPBOPPRKH 

rumapsgqcRqqsli 

SVNBQTHRDQRRTMJ 
*TWOCRUISERSSURK 
UXPDSVJTFST1' VOL 
VYQETWKUGTUUWPM 
WZRFUXLVHUVVXQN 
XASGV*YMWIVWWYRO 
YBTHWZNXJWXXZSP 
ZCUIXAOYKXYYATQ 
ADVJYBPZLYZZBUR 
BEWKZC QAMZAACV S 
CFXLADRBNABBDWT 
DGYMBESCOBCCEXU 
EHZNCFTDPCBDFYV 
FIAOBGUEQBEEGZW 
GJBPEHVFREFFHAX 
HKCQFIWGSFGGIBY 
ILDRGJXHTGHHJCZ 
JMESHKYIUHIIKDA 
KNFTILZJVIJJLEB 
LOGUJ Mi A KWJKKMFC 
MPHVKNBLXKLLNGD 

Figure 12 . 

(8) The only difference in procedure between this case and the 
preceding one (where the cipher alphabet vas a direct standard alphabet) 
is that the letters of the cipher text a,re first "deciphered" by means 
of any reversed standard alphabet and then the columns are "run dosm", 
according to the normal ABC . * . Z sequence. For reasons which will 
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become apparent very soon, the first step in this method is technically 
termed converting the cipher letters into their plain-component 



lents ; the second step is the same as before, , 

component sequence* ~~ ”** 



SISEJPE 






35. Special remarks on the method of solution by completing the 



u ""£• The terms employed to designate the steps 
in the solution set forth in par. 34h(8), viz ., “converting the cipher 
letters into their plain-component equivalents" and "completing the plain- 
component sequence", accurately describe the process. Their meaning will 
become more clear as the sbudent progresses with the work. It may be said 
that whenever the components of a cipher alphabet are known sequences, no 
matter how they are composed, the difficulty and time required to solve 
any cryptogram involving the use of those components is considerably re- 
duced. In some cases this knowledge facilitates* and in other cases is 



the only thing that makes possible, the solution of a very short cryptogram 



that might otherwise defy solution. Later on an example will be given to 



illustrate what is meant in this regard. 



b. The student should take note, however, of two qualifying expres- 
sions that were employed .Ln a preceding paragraph to describe the results 
of the application of the method. It was stated that "one of the gener- 
atrices will exhibit intelligible text all the way across ; this text will 
practically invariably be the plain text . 11 Will there ever be a case in 
which more than one generatrix will yield intelligible text through its 
extent? That obviously depends almost entirely on the number of letters 
that are aligned to form a generatrix. If a generatrix contains but a 
very few letters, only five, for example, it may happen as a result of 
pure chance that there will be two or more generatrices showing what 
might he "intelligible text." Note in Fig. 11, for example, that there 
are several cases in which 3-letter and 4-letter English words (LAD, COB, 
MESH, MAPS, etc.) appear on generatrices that are not correct, these 
words being formed by pure chance. But there is not a single case, in this 
diagram, of a 5-letter or longer word appearing fortuitously, because 
obviously the longer the word the smaller the probability of its appear- 
ance purely by chance j and the probability that two generatrices of 15 
letters each will both yield intelligible text along their entire length 

is exceedingly remote, so remote, in fact, that in practical cryptology 
such a case may be considered nonexistent .7 

c. The student should observe that in reality there is no difference 
whatsoever in principle between the two methods presented in subpars. a 
and b of par, 34* In the former the preliminary step of converting the 
cipher letters into their plain-component equivalents is apparently not 
present but in reality it is there. The reason for its apparent absence 
is that in that case the plain component of the cipher alphabet is ident- 
ical in all respects with the cipher component, so that the cipher letters 

^ A person •sri.th patience and an inclination toward the curiosities of 
the science might construct a text of 15 or more letters which would yield 
two "intelligible" texts on the plain-component completion diagram. 
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require no conversion, or, rather, they are identical with the equivalents 
that would result if they were converted on the basis A c sAp. In fact, if 
the solution process had been arbitrarily initiated by converting the ci- 
pher letters into their plain-component equivalents at the setting Ac=Op, 
for example, and the cipher component slid one interval to the right 
thereafter, the results of the first and second tests of par. would 
be as follows: 

Cryptogram- — — ZCUIXAOYKXYYATQ 

1st Test— — 1 1 Plain text 11 — N Q I w" L 0 CHI LMHOHE 

2d Test "Plain text"— ORJXMPDNZMNNPIF 

Thus, the foregoing diagram duplicates in every particular the diagram 
restating from the first two tests under par. 3tb: a first line of cipher 

letters , a second line of letters derived from them but showing externally 
no rel . ionship with the first line, and a third line derived immediately 
from the second line by continuing the direct normal sequence. TM« '-»Oint 
is brought to attention only for the purpose of showing that a si: - ere, 
broad principle is the basis of the general method of solution by complet- 
ing the plain-component sequence, and once the student has this ’’irmly in 
mind he will have no difficulty whatsoever in realizing whe? . principle 
is applicable, what a powerful crypt analytic tool it can be, and what 
results he may expect from its application in specific instances. 

d. In the two foregoing examples of the application of the prin- 
ciplej the components were normal sequences; but it should be clear to 
the student, if he has grasped" what has been said in the preceding sub- 
paragraph, that these components may be mixed sequences which, if known 
(that is, if the sequence of letters comprising the sequences is known to 
the cryptanalyst), can be handled just as readily as can components that 
are normal sequences. 

e. It is entirely immaterial at what points the plain and the cipher 
components are juxtaposed in the preliminary step of converting the cipher 
letters into their plain-component e qul valent s . For example, in the case 
of the reversed alphabet cipher solved in par. 3^b, the two components 
were arbitrarily juxtaposed to give the value Ap=Ac, but they might have 
been juxtaposed at any of the other 25 possible points of coincidence 
without in any way affecting the final result, viz ., the production of 
one plaintext generatrix in the completion diagram. 

36. Value of mechanical solution as a short cut . — a. It is evident 
that the very first step the student should take in his~*attempts to solve 
an unknown cryptogram that is obviously a substitution cip h er is to try _ 
the mechanical method of solution by completing the plain-component se- 
quence, using the normal alphabet, first direct, then reversed. This 
hakes only a very few minutes and is conclusive in its results . It saves 
the labor and tx’ouble of constructing a frequency distribution in case 
the cipher is of this simple type. Later on it will be seen how certain 
variations of this simple type may also be solved by the application of 
this method. Thus, a very easy short cut to solution is afforded, which 
even the experienced cryptanalyst never overlooks in his first attack on 
an unknown cipher. 

gfjrpoTnrnwn 
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b. It Is important now to note that if neither of the two foregoing 
attempts is successful in bringing plain text to light and tho cryptogram 
is quite obviously monoalphabetic in character, the cryptanalyst is war " 
ranted in assuming fcliab the cryptogram involves a mixed cipher alphabet * 



37. Basic reason for the low degree of ; ypto security afforded by 
monoalpliabetic cryptograms involving standard cipher alphabets . — The 
student has jseen. that the solution of mono alphab e t ic cryptograms Involving 
standard cipher alphabets is a very easy matter, to methods of analysis 
were described, one involving the construction of a frequency distribu- 
tion, the other not requiring this kind of tabulation, being almost mech- 
anical in nature and correspondingly rapid. In the first of these two 
methods it was necessary to make a correct assumption as to the value of 
hut one of the 26 letters! of the cipher alphabet and the values of the 
remaining 25 letters at once became known; in the second method it was not 
necessary to assume a "Value for even a single cipher letter. The student 
should understand what constitutes the basis of this situation, viz ., the 
fact that the two components of the cipher alphabet are composed of known 
sequences . "What if one or "both of these components are, for the crypt- 
analyst, unknown sequences ? In other words, what difficulties will con- 
front the cryptanalyst if the cipher component of the cipher alphabet is 
a mixed sequence? Will such an alphabet be solvable as a whole at one 
stroke, or will it he necessary to solve its values individually? Since 
the determination of the value of one cipher letter in this case gives no 
direct clues to the value of any other letter, it would seem that the 
solution of such a cipher should involve considerably more analysis and 
experiment than has the solution of either of the two types of ciphers so 
far examined. The steps to he taken in the cryptanalysis of a mixed- 
alphabet cipher will he discussed in the next section. 

^ There is hut one other possibility, already referred to under 
subpar. 31d which involves the case where transposition and monoalphabetic 
substitution processes have been applied in successive steps. This is 
unusual, however, and will be dismissed in its proper place. 
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SECTION VI 



UN3XITERAL SUBSTITUTION WITH MIXED CIPHER ALPHABETS 



r - Paragraph 

Literal keys and numerical keys..* ........ 38 

Types of mixed cipher alphabets... ......... ......... 39 

Additional remarks on cipher alphabets 40 

Preliminary steps in the analysis of a monoalphabet ic, mixed- 

alphabet cryptogram. 4l 

Preparation of the work sheet *+2 

Triliteral frequency distributions 43 
Classifying the cipher letters into vowels and consonants........... 44 

Further analysis of the letters representing vowels and consonants.. 45 

Substituting deduced values in the cryptogram. 46 

Completing the solution 47 

General remarks on the foregoing solution 48 

The "probable-word" method; its value and applicability 49 

Solution of additional cryptograms produced by the same 

components 5° 

Derivation of key words 51 



38, Literal keys and numerical keys . — >a. As has been previously 
mentioned, most cryptosystems involve the use of a specific key to con- 
trol the steps followed in encrypting or decrypting a specific message 
(see subpar. 9b). Such a key may be in literal form or in numerical form* 



b. It is convenient to designate a key which is composed of letters 
as a literal key . As already mentioned, a literal key may consist of a 
single letter, a single word, a phrase, a sentence, a whole paragraph, or 
even a book; and, of course. It may consist merely of a sequence of let- 
ters chosen at random. 



c. Certain cryptosystems involve the use of a numerical key , which 
may consist of a relatively long sequence of numbers difficult or impos- 
sible for the average cipher clerk to memorize. Several simple methods 
for deriving such sequences from words, phrases, or sentences have been 
devised, and a numerical key produced by any of these methods is called a 
derived numerical key (as opposed to a key consisting of randomly-selected 
numbers)'. One of the commonly-used methods consists of assigning numer- 
ical values to the letters of a selected literal key in accordance with 
their relative positions in the ordinary alphabet, as exemplified in the 
following subparagraph. 
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d. Let the prearranged key word "be the word LOGISTICS. Since C, 
the penultimate letter of the key word, appears in the normal alphabet 
"before any other letter of the key word, it is assigned the number 1: 

LOGISTICS 

1 

The next letter of the normal alphabet that occurs In the key word is G, 
which is assigned the number 2 . The letter I, which occurs twice in the 
key word, is assigned the number 3 for its first occurrence and the num- 
her 4* for its second occurrence! and so on. The final result is; 

LOGISTICS 

5623791*18 

This method of assigning the numbers is very flexible and varies with 
different uses to which numerical keys are put. It may, of course, be 
applied to phrases or to sentences, so that a very long numerical key, 
ordinarly impossible to remember, may be thus derived at will from an 
easily-remembered key text . 

e. As far as the cryptanalyst is concerned, the derivation of a 

• numerical key from a specific literal key is of interest to him because 
this knowledge may assist in subsequent solutions of cryptograms prepared 
according to the same basic system, or in identifying the source from 
which the literal key was selected - perhaps an ordinary book, a magazine, 
etc. However, it should he pointed out that in some instances the crypt- 
analyst may be unaware that a literal key has in fact been used as the 
basis for deriving a numerical key. 

39 » Types of mixed cipher alphabets . — a. It will be recalled that 
in a mixed cipher alphabet the sequence of letters or characters in one 
of the components (usually the cipher component) does not correspond to 
the normal sequence. There are various methods of composing the sequence 
of letters or elements of this mixed component, and those which are based 
upon a scheme that is systematic in its nature are very useful because 
they make possible the derivation of one or more mixed sequences from any 
easily-remembered word or phrase, and thus do not necessitate the carry- 
ing of m*itten memoranda. Alphabets Involving a systematic method of 
mixing are called systematically-mixed cipher alphabets . 

b. One of the simplest types of systematically-mixed cipher alpha- 
bets Ts the keyword-mixed alphabet . The cipher component consists of a 
key word or phrase "(with repeated” letters , if present, omitted alter 
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their first occurrence )^, followed by the letters of the alphabet in 
their normal sequence (with letters already occurring in the key omitted 
of course). Example , with GOVERNMENT as the key word: 

Plain i ABCDEFGHIJKUvINOPQRSTUVWXYZ 

C ipher : GOVERIMTABCDm JKLPQSUWm 

c. It is possible to disarrange the sequence constituting the 
cipher component even more thoroughly by applying a simple method of 
transposition to the keyword-mixed sequence. Two common methods are 
illustrated below, using the key word [TELEPHONY. 

(1) Simple columnar transposition : 

'TELPHONY 
ABCDFGIJ 
K M Q"R S XI V W 
X z 

Mixed sequence (formed by transcribing the successive columns from left 
to right): 

TAKXEBMZLC QFDRHFSOGUNIVYJW 

(2) Numerically-keyed" columnar transposition : 

’ ‘ 7-1-3-6-2-5-4-8 

TELPHONY 
ABCDFGIJ 
KMQRSUVW 
X Z 

Mixed sequence (formed by transcribing the columns in a sequence deter- 
mined by the numerical key derived from the key word itself) : 

EBMZHFSLC QNIVOGUPDRTAKXYJW 



Mixed alphabets formed by including all repeated letters of the 
key word or key phrase in the cipher component were common in Edgar 
Allan Poe's day but are impractical because they are ambiguous, making 
decipherment difficult; an example : 

ABCDEFGHIJXIMIOPQRSTUVNXYZ 
NOWISTHETIMEFORALLGOODMENT 

ABCDEFGHIJKIMNOPQRSTUVWXXZ 
P VHMSGD QKAB OEF C 

L J HWYN I 

X T Z 

tr 

The average cipher clerk would have considerable difficulty in decrypting 
a cipher group such as TOOET, each letter of which has three or more 
equivalents, and from which the plaintext fragments (N)lNTH,, ..IT THI(S) 
IT THI..., etc. can be formed on decipherment. 



Plain: 

(a) Alphabet for enciphering.— Cipher: 

Cipher: 

(b) Inverse form of (a). Plain: 

for deciphering, — — 
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d. The last two systematically-mixed sequences are examples of 
transposition-mixed sequences . Almost any method of transposition may be 
used to produce such sequences. 

e. Another simple method of forming a mixed sequence is the deci - 
mation method . In this method, letters in the normal alphabet, or in a 
keyword-mixed sequence, are "counted off” according to any selected in- 
terval. As each letter is decimated— that is, eliminated from the basic 
sequence by counting off — it is entered in a separate list to form the 
new mixed sequence. For example, to form a mixed sequence by this xnethod| 
from a keyword-mixed sequence based on the key phrase SING A SONG OF 
SIXPENCE with 7 the interval selected, proceed as follows: 

Keyword-mixed (or basic) sequence: 

SINGAOFXKCBDHJKI^QRTUVWYZ 

When the letters are counted off by 7 'a from left to right, F will be the 
first letter arrived at, H the second, T the third: 

SINGAO^XPECBDjfjKLMQRtfUVWYZ 

123^56ll23456jl23^567 

These letters are entered in a separate list (F first, H second, T third, 
and so on) and eliminated from the keyword-mixed sequence. When the 
end of the keyword-mixed sequence is reached, return to the beginning, 
skipping the letters already eliminated: 

S^NGAO^XPjdCBDjfJKLytQRTfUVWYZ 

12345 

621234 563123 ^563 

The decimation-mixed sequence: 

FIfflEJIZPQIOTCVBSLXAGOKYJRU 

f. Practical considerations, of course, set a limit to the complex- 
ities that may be introduced in constructing systematically-mixed alpha- 
bets. Beyond a certain point there is no object in further mixing. The 
•greatest amount of mixing by systematic processes will give no more se- 
curity than that resulting from mixing the alphabet by random selection, 
such as by putting the 26 letters in a box, thoroughly' shaking them up, 
and then drawing the letters out one at a time. Whenever the laws of 

■ chance operate in the construction of a mixed alphabet, the probability 
of producing a thorough disarrangement of letters is very great. Random - 
mixed alphabets give more cryptographic security than do the less compli- 
cated syBtematically-mixed alphabets, because they afford no clues to 
positions of letters, given the position of a few of them. Their chief 
disadvantage is that they must be reduced to writing, since they cannot 
readily be remembered, nor can they be reproduced at will from an easily- 
remembered key word. 
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fro. Additional remarks on cipher alphabets , — a. All cipher alpha- 
bets may he classified on the "basis of their arrangement as enciphering 
or deciphering alphabets . An enciphering alphabet is one in which the 
sequence of letters in the plain component coincides with the normal 
sequence and is arranged in that manner for convenience in encipherment. 
In a deciphering alphabet the sequence of letters in the cipher component 
coincides with the normal, for convenience in deciphering. For example, 
(l), below, shows a mixed cipher alphabet arranged as an enciphering al- 
phabet; (2) shows the corresponding deciphering alphabet. An enciphering 
alphabet and its corresponding deciphering alphabet present an inverse 
relationship to each other. To invert a deciphering alphabet is to write 
the corresponding enciphering alphabet; to invert an enciphering alphabet 
is to write the corresponding deciphering alphabet. 



Enciphering Alphabet 

Plain: ;0CDEFGniJKLMNOPQRSTUVWXYZ 

' ■ 1 ' C ipher : .JKQVXZWESTRMJIOLGAPHCMYBDF 



Deciphering Alphabeb 



( 2 ) 



C ipher : ABCDEFGHIJKtiMOPQRSTUVWXyZ 

Plain: RXUIHZQTWABIVLOSCKIJMDGEWF 



b, A series of related reciprocal alphabets maybe produced by 
juxtaposing at all possible points of coincidence two components which 
are identic a!, but progress in opposite directions. This holds regardless 
of whether tlie components are composed of an even or an odd n umb er* of 
elements. The following reciprocal alphabet is one of such a series of 
26 alphabets: 

Plain: 'H^^ULICTEFGJKMOPQGTVWXZ 

- Cipher: GFEBC ILUARDYHZXWVTSQPONMKJ 

A single or isolated reciprocal alphabet may be produced in one of two 
ways: 



(1) By constructing a complete reciprocal alphabet by arbitrary or 

random assignments of values in pairs. That is, if Ap is made the 
equivalent of K c , then ICp is made the equivalent of A,,; if Bp is made R c , 
then Rp is made B c , and so on. If the two components thus constructed 
are slid against each other no additional reciprocal alphabets will be 
produced. _ * 

(2) By juxtaposing a sequence coinprising an even number of elements 
against the same sequence shifted exactly half way to the right (or left), 
as seen below: 



ABCDEFGHIJKLMNOPQRSTUVWXYZ, 

ABCDEFGHIJKLMNOPQRS'riJTWXYZABCDEFGHIJICLMNOPQRSTUVWXYZ 
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hi. Freliminary steps in the a n alysis of a monoalphabetic , mixed - 
alphabet cryptogram . — a. The student is now ready to resume Ms crypt- 
analytic studies. Note the following cryptogram: 



SFDZF IOGEL PZFGZ DYSFF HBZDS GVHTF UPLVD FGYVJ VFVHT GADZZ AIT YD ZYFZJ 
22TGFT VTZBD VFHTZ DFXSB GIDZY VTXOI YVTEF VMGZZ THLLV XZDFM HTZAI TYDZY 
BDVFH TZDFK ZDZZJ SXISG ZYGAV FSLGZ DTHHT CDZRS VTYZD OZFFH TZAIT YDZYG 
AVDGZ ZTKHI TYZYS DZGHG ZFZTG UPGDI XWGHX ASRUZ DFUID EGHTV EAGXX 



b. A casual inspection of the text discloses the presence of sev- 
eral Tong repetitions as well as of many letters of normally low fre- 
quency, such as F, G, V, X, and Z; on the other hand, letters of normally 
high frequency, such as the vowels, and the consonants N and R, are rel- 
atively scarce. The cryptogram is obviously a substitution cipher and 
the usual mechanical tests for determining whether it is possibly of the 
monoalphabetic, standard- alphabet type are applied. The results being 
negative, a uniliteral frequency distribution is immediately constructed, 
as shown in Figure 13, and the <j> test is applied to it. , 



g § 

A B 

8 4 
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LMNOPQRSTUVWXYZ 

S 2 0 3 5 0 2 10 22 6 10 1 8 14 35 



4^=3668 <ji r =2117 4>o=3862 

Figure 13* 



c. The fact that the frequency distribution shows very marked 
crestsT and troughs indicates that the cryptogram is very probably mono- 
alphabetic, and the results of the <j> test further support this hypothesis. 
The fact that the cryptogram has already been tested by the method of 
completing the plain-component sequence and found not to he of the mono- 
alphabetic, standard-alphabet type, indicates with a high degree of 
probability that it involves a mixed cipher alphabet. A few moments might 
be devoted to making a careful inspection of the distribution to insure 
that it cannot be made to fit the normal; the object of this would be to 
rule out the possibility that the text resulting from substitution by a 
standard cipher alphabet had not subsequently been transposed. But this 
inspection in this case is hardly necessary, in view of the presence of 
long repetitions in the message. 2 (See subpar, 25g.) 



2 This possible step is mentioned here for the purpose of making it 
deal' that the plain-component sequence completion method cannot solve a 
case in which transposition has followed or preceded monoalphabetic sub- 
stitution with standard alphabets . Cases of this kind will be discussed 
in a later text. It is sufficient to indicate at this point that the - 
frequency distribution for such a combined substitution- transposition 
cipher would present the characteristics of a standard alphabet cipher 
and yet the method of completing the plain-component sequence would fail 
to bring out any plain text. 
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a. One might, of course, attempt to solve the cryptogram by applying 
the simple principles of frequency. One might, in other words, assume 
that Z c (the letter of greatest frequency) represents Ep, D c (the letter 
of next greatest frequency) represents Tp, ana so on. If the message 
were long enough this simple procedure might more or less quickly give 
the solution. But the message is relatively short and many difficulties 
would he encountered. Much time and effort would be expended unnecessar- 
ily, because it is hardly to be expected that in a message of only 235 
letters -the relative order of frequency of the various cipher letters 
should exactly coincide with, or even closely approximate the relative 
order of frequency of letters of normal plain. text found in a count of 
50,000 letters. It is to be emphasized that the beginner must repress 
the natural tendency to place too much confidence in the generalized prin - 
ciples of frequency and to rely too much upon them . It is far better to 
bring into effective use certain other data concerning normal plain text, 
such as digraphic and trigraphic frequencies, 

l>-2. Preparation of the work sheet . — a. The details to be- considered 
in this paragraph may at first appear to be superfluous, but long expe- 
rience has proved that systematization of the work and preparation of the 
data in the most utilizable, condensed form is most advisable, even if 
this seems to take considerable time. In the first place, if it merely 
serves to avoid interruptions and irritations occasioned by failure to 
have the data in an instantly available form, it will pay by saving men- 
tal wear and tear. In the second place, especially in the case of com- 
plicated cryptograms, painstaking care in these details, while it may not 
always bring about success, is often the factor that is of greatest 
assistance in ultimate solution. The detailed preparation of the data 
may be irksome to the student, and he may be tempted to avoid as much of 
it as possible, but, unfortunately, in the early stages of solving a 
cryptogram he does not know (nor, far that matter, does the expert always 
know) Just which data are essential and which may be neglected. Even 
though not all of the data may turn out to have been necessary, as a gen- 
eral rule, time is saved in the end if all the usual data are prepared as 
a regular preliminary to the solution of most cryptograms, 

b. First, the cryptogram is recopied in the form of a work sheet . 
This sheet should be of a good quality of paper so as to withstand con- 
siderable erasure . If the cryptogram is to be copied by hand, cross- 
section paper of iy-inch squares is extremely useful. The writing should 
be in ink, and plain, carefully-made roman capital letters should be used 
in all cases .3 If the cryptogram is to be copied on a typewriter, the 
ribbon employed should be impregnated with an ink that will not smear or 
smudge under the hand, 

. 1 

It is advisable to use, for this purpose, the system of standardized 
manual printing adopted by Service communications personnel. The vise of 
this system, which is included in Appendix 7, assures that work sheets 
are completely legible, not only to the person preparing them, but to 
others as well. 
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c. The arrangement of the characters of the cryptogram on the work 
sheet is a matter of considerable importance. If the cryptogram as first 
Obtained is in groups of regular length (usually five characters to a 
group) and if the imiliteral frequency distribution shows the cryptogram 
to be monoalphabetic , the characters should be copied without regard to 
this grouping. It is advisable to allow one space between letters (this 
is especially true for work sheets prepared on the typewriter), and to 
write a constant number of letters per line, approximately 25. At least 
two spaces, preferably three spaces, should be left between horizontal 
lines, to allow room for multiple assumptions. Care should be taken to 
avoid crowding the letters in any case, for this is not only confusing to 
the eye but also mentally irritating when later it is found that not 
enough space has been left for making various sorts of marks or indica- 
tions . If the cryptogram is originally in what appears to be word lengths 
(and this is the case, as a rule, only with the cryptograms of amateurs), 
naturally it should be copied on the work sheet in the original group- 
ings , If further study of a cryptogram shows that some special grouping 
is required, it is often best to recopy it on a fresh work sheet rather 
than to attempt to indicate the new grouping on the old work sheet. 

d. In order to be able to locate or refer to specific letters or 
groupls of letters with speed, certainty, and without possibility of con- 
fusion, it is advisable to vise coordinates applied to the lines and 
columns of the text as it appears on the work sheet. To minimize possi- 
bility of confusion, it is best to apply letters to the horizontal lines 
of the text, numbers to the vertical columns. In referring to a letter, 
the horizontal line in which the letter is located is usually given first. 
Thus, referring to the work sheet shown below, coordinates A17 designate 
the letter Y, the 17th letter in the first line. The letter I is usually 
omitted from the series of line indicators so as to avoid confusion with 
the figure 1. If lines are limited to 25 letters each, then each set of 
100 letters of the text is automatically blocked off by remembering that 

4 lines constitute 100 letters. 

e. Above each character of the cipher text may be some indication 
of the frequency of that character in the whole cryptogram. This indi- 
cation may he the actual number of times the character occurs, or, if 
colored pencils are used, the cipher letters may he divided up into three 
categories or groups — high-frequency,- medium-frequency, and loir- frequency. 
It 'is perhaps simpler, if clerical help Is available, to indicate the 
actual frequencies. This saves constant reference to the frequency 
tables, which interrupts the train of thought, and saves considerable 
time In the end, since it enables the student better to visualize fre - 
quency-pat herns of words. In any case, it is recommended that the fre- 
quencies of the letters comprising the repetitions be inscribed over their 

1 In some cryptosystems, certain low-frequency letters are employed as 
word separators to indicate the end of a word; if the meaning of these 
letters is discovered, it is tantamount to having the cryptogram in word 
lengths and thus the work sheet is made accordingly. See also in this 
connection the treatment on word separators in Section VII . 
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respective letters j likewise, the frequencies of the first 10 and last 10 
letters should also he inscribed, as these positions often lend themselves 
readily to attack 

f . After the special frequency distribution, explained in Par* ^3 
below, has been constructed, repetitions of digraphs and trigraphs should 
be underscored. In so doing, the student should be particularly watchful 
for trigraphic repetitions which can be further extended into tetragraphs 
and polygraphs of greater length. Repetitions of more than ten charac- 
ters should be set off by heavy vertical lines, as they indicate repeated 
phrases and are' of considerable assistance in solution. If a repetition 
continues from one line to the next, put an arrow at the end of the under- 
score to signal this fact. Reversible digraphs and trigraphs should also 
be indicated by an underscore with an arrow pointing in both directions • 
Anything which strikes the eye as being peculiar, unusual, or significant 
as regards the distribution or recurrence of the characters should be 
noted. All these marks should, if convenient, be made with ink so as not 
to cause smudging. The work sheet will now appear as shown below (not 
all the repetitions are underscored): 





i 


2 


3 


4 


5 


0 


7 


8 


9 


10 


11 


12 


13 


14 


15 


10 


17 


18 


19 


20 


21 


22 


23 


24 


25 




IQ 


19 


23 


35 


10 


10 


3 


10 


15 


5 


5 


35 


19 


19 


35 


23 


14 


10 


5 


19 


15 


4 


36 


23 


10 


A 


s 


F 

«— 


D 


Z 
— > 


F 


I 


0 


G 


H 


L 


P 


z 


F 


G 


z 


D 


J 


S 


P 


F 


H 


B 


Z 


D 


S 




10 


10 


15 


22 


19 


5 


5 


6 


10 


23 


19 


19 


14 


10 


3 


10 


19 


10 


15 


22 


19 


8 


23 


35 


35 


B 


G 


V 


H 


T 


F 


u 


p 


L 


V_ 


JD 


F 


G 


Y 


V 


J 


V 


F 


V 


H 


T 


G 


A 


D 


Z 


z 




8 


10 


22 


14 


23 


35 


14 


19 


35 


3 


35 


22 


19 


5 


22 


10 


22 


35 


4 


23 


10 


19 


16 


22 


35 


C 


A 


I 


T 


Y 


D 


z 


Y 


F 


z 


J 


z 


T 


G 


p 


T 


V 


T 


z 


B 


D 


V 


F 


H 


T 


z 




23 


19 


8 


10 


4 


— * 
19 


10 


23 


35 


14 


16 


22 


8 


3 


10 


14 


10 


22 


3 


19 


18 


2 


19 


35 


35 


D 


D 


F 


X 


s 


33 


G 


I 


D 


z 


Y 


V 


T 


X 


0 


I 


Y 


V 


T 


E 


F 


V 


M 


G 


z 


z 


’ 


22 


15 


6 


5 


10 


8 


35 


23 


19 


2 


15 


22 


36 


8 


10 


22 


14 


23 


35 


14 


4 


23 


10 


19 


15 


E 


T 


H 


L 


L 


V 


X 


z 


D 


F 


M 


H 


T 


z 


A 


I 


T 


Y 


D 


z 


Y 


B 


D 


V 


F 


H 




22 


35 


23 


19 


2 


35 


23 


35 


35 


3 


10 


8 


10 


10 


19 


35 


14 


19 


8 


18 


19 


10 


6 


19 


35 


F 


( T 


_z. 


_D 


_F 


K 


z 


D 


z 


z 


J 


S 


X 


I 


S 


G 


z 


Y 


G 


A 


V 


F 


S 


L 


G 


z 




23 


22 


16 


15 


22 


1 


23 


35 


2 


10 


10 


22 


14 


35 


23 


8 


35 


19 


19 


15 


22 


35 


8 


10 


22 


G 


D 


T 


H 


H 


T 


c 


D 


z 


R 


s 


V 


T 


Y 


z 


D 


0 


z 


F 


F 


H_ 


X 


_z_ 


JL 


X 


X 




H 


23 


35 


14 


19 


8 


18 


23 


19 


35 


35 


22 


2 


15 


10 


22 


14 


35 


14 


10 


23 


85 


19 


15 


5 


H 


( Y 


D 


z 


Y 


G 


A 


V 


D 


G 


z 


z 


T 


K 


H 


I 


T 


Y 


z 


Y 


S 


D 


z 


G 


H 


u 




3 5 


19 


35 


22 


19 


5 


5 


19 


23 


10 


8 


l 


19 


15 


8 


8 


10 


2 


5 


35 


23 


19 


5 


10 


23 


J 


z 


F 


z 


T 


G 


u 


P 


G 


D 


I 


X 


ff 


G 


H 


X 


A 


S 


R 


u 


z. 


_D 


J! 


u 


I 


D 


~~ .. 


3 


19 


15 


22 


18 


3 


8 


19 


8 


8 
































K 


& 


G 


H 


T 


V 


E 


A 


G 


X 


X 

































** See Appendix 4 in this connection. 
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f t3 « Tr illter al frequency dis bributions , — a, Jn what has gone "before, 
a type of frequency distribution known as“a uniliteral frequency distri- 
bution was used. This, of course, shows only the number of times each 
individual letter occurs . In order to apply the normal digraphic and 
trigraphic frequency data (given in Appendix 2) to the solution of a 
cryptogram of the type now being studied, it is obvious that the data 
with respect to digraphs and trigraphs occurring in the cryptogram should 
be compiled and should be compared with the data for normal plain text. 

In order to accomplish this in suitable manner, it is advisable to con- 
struct a more comprehensive form of distribution termed a triliteral 
frequency distribution .” ” 

b. Given a cryptogram of 50 or more letters and the task of deter- 
mining what trigraphs are present in the cryptogram, there are three ways 
in which the data may be arranged or assembled. One may require that the 
data show (l) each letter with its two succeeding letters; (2) each let- 
ter with its two preceding letters; (3) each letter with one preceding 
letter and one succeeding letter. 

c . A dd stribution of the first of the three foregoing types may be 
designated as a “triliteral frequency distribution showing two suffixes"; 
the second type may be designated as a “triliteral frequency distribution 
showing two prefixes" ; the third type may be designated as a “triliteral 
frequency distribution showing one prefix and one suffix." Quadriliteral 
and pentaliteral frequency distributions may occasionally be found useful. 

d. Which of these three arrangements is to he employed at a specific 
time depends largely upon what the data are intended to show. For present 
purposes, in connection with the solution of a monoalphabetic substitution 
cipher employing a mixed alphabet, possibly the third arrangement, that 
showing one prefix and one s 'affix, is most satisfactory. 

e. It is convenient to use -jj-inch cross-section paper for the con- 
struction of a triliteral frequency distribution in the form of a distri- 
bution showing crests and troughs, such as that in Figure l4. In that 
figure the prefix to each letter to be recorded is inserted in the left 
half of the cell directly above the cipher letter being recorded; the 
suffix to each letter is inserted in the right half of the cell directly 
above the letter being recorded; and in each case the prefix and the suf- 
fix to the letter being recorded occupy the same cell, the prefix being 
directly to the left of the siiffix. The number in parentheses gives the 
total frequency for each letter. 



It is felt advisable here to distinguish between two closely related 
terms. A triliteral distribution of A B C D E F would consider the 
groups A B C, B C D, C D E, D E F; a trigraphic distribution would con- 
sider only the trigraphs ABC ana DE F, (See also subpar. 23d.) 
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CONDENSED TABLE OF REPETITIONS 






Digraphs 




Trigraphs 


Longer Polygraphs 


DZ-9 


TZ— 5 


VF-4 


DZY-4 


FHT-3 


HTZAITYDZY— 2 


ZD-9 


TY— 5 


VT-4 


HTZ-4 


TYD-3 


BDVFHTZDF-2 


HT-8 


FH— 4 


ZF-4 


ITY-4 


YDZ— 3 


ZAITYDZY— 3 


ZY-6 


GH-4 


ZT-4 


ZDF-4 


ZAI-3 


FHTZ— 3 


DF-5 


IT-4 


ZZ-4 


AIT-3 






GZ-5 













UD 

FT 

UF 

DG 

YY 

ZT 

GZ 

DY 

TA 

OF 

YD 



DR 









,IE 










































GD 








ZF 
































HV 










GY 








GI 
































ZG 










ZJ 








SZ 
































IY 










DZ 




, 




VG. 




DU 


AX 


























ZK 




- 






KD 




■ 




YZ 




ZZ 


EH 


























IY 










TD 






, 


ZO 




FH 


WH 


























HZ 






» 




DY 






1 


CZ 




ZF 


PD 
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f . The triliteral frequency distribution is now to he examined with 
a view to ascertaining what digraphs and trigraphs occur two or more 
times in the cryptogram. Consider the pair of columns containing the 
prefixes and suffixes to D c in the distribution, as shown in Fig. 14. 

This pair of columns shows that the following digraphs appear in the 
cryptogram: 



Digraphs based on prefixes 
(arranged as one reads up 
the column) 



Digraphs based on suffixes 
(arranged as one reads up 
the column) 



FD, ZD, 2D, VD, AD, 
ZD, ID, ZD, YD, BD, 
ZD, CD, ZD, YD, VD, 
ZD, ID 



YD, 


BD, 


DZ 


ZD, 


ZD, 


DF 


SD, 


GD, 


DT 


. 




df' 



DY, DS, DF, DZ, DZ, DV, 

DZ, DF, DZ, DV, DF, DZ, 
DZ, DO, DZ, DG, DZ, DI, 
DE 



The nature of the triliteral frequency distribution is such that in find- 
ing what digraphs are present in the cryptogram it is immaterial whether 
the prefixes or the suffixes to the cipher letters are studied, so long 
as one is consistent in the study . For example, in the foregoing list of 
digraphs based on the prefixes to D c , the digraphs FD, ZD, ZD, VD, etc., 
are found; If now, the student will refer to the suffixes of F c , z c , v c , 
etc., he will find the very same digraphs indicated. This being the case, 
the question may be raised as to what value there is In listing both the 
prefixes and the suffixes to the cipher letters. The answer is that by 
so doing the trigraphs are indicated at the Bame time. For example, in 
the case of D c , the following trigraphs are indicated: 



FDZ, ZDY, ZDS, VDF, ADZ, YDZ, BDV, ZDF, IDZ, ZDF, YDZ, BDV, ZDF, 

ZDZ, ZDT, CDZ, ZDO, YDZ, VDG, SDZ, GDI, ZDF, IDE. 

g. The repeated digraphs and trigraphs can now be found quite read- 
ily. Thus, In the case of D c , examining the list of digraphs based on 
suffixes, the following repetitions are noted: 



DZ appears 9 times; DF appears 5 times; DV appears 2 times 



Examining the trigraphs with D c as central letter, the following repeti- 
tions are noted: 



ZDF appears 4 times; YDZ appears 3 times; BDV appears 2 times 

h. It is unnecessary, of course, to go through the detailed proce- 
dure set forth in the preceding subparagraphs in order to find all the 
repeated digraphs and trigraphs . The repeated trigraphs with D c as cen- 
tral letter can be found merely from an Inspection of the prefixes and 
suffixes opposite D c in the distribution. It is necessary only to find 
those cases in which two or more prefixes are Identical at the same time 
that the suffixes are identical. For example, the distribution shows at 
once that in four cases the prefix to D c is Z c at the same time that the 
suffix to this letter is F c . Hence, the trigraph ZDF appears four times. 
The repeated trigraphs may all be found in this manner. 
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CONDENSED TABLE OF REPETITIONS 
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Trigraphs 


Longer Polygraphs 
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£• The triliteral frequency distribution is now to he examined with 
a view to ascertaining what digraphs and trigraphs occur two or more 
times in the cryptogram. Consider the pair of columns containing the 
prefixes and suffixes to D c in the distribution, as shown in Fig. l4. 

This pair of columns shows that the following digraphs appear in the 
cryptogram; 



Digraphs based on prefixes 
(arranged as one reads up 
the column) 



Digraphs based on suffixes 
(arranged as one reads up 
the column) 



ED, ZD, 2D, VD, AD, 
ZD, ID, ZD, YD, BD, 
ZD, CD, ZD, YD, VD, 
ZD, ID 



YD, 


BD, 


DZ 


ZD, 


ZD, 


DF 


SD, 


GD, 


DT 


■ 




DF 



DY, D3, DF, DZ, DZ, DV, 

DZ, DF, DZ, DV, DF, DZ, 
DZ, DO, DZ, DG, DZ, DI, 
DE 



The nature of the triliteral frequency distribution is such that in find- 
ing what digraphs are present in the cryptogram it is immaterial whether 
the prefixes or the suffixes to the cipher letters are studied, so long 
as one is consistent in the study . For example, in the foregoing list of 
digraphs based on the prefixes to D e , the digraphs FD, ZD, ZD, VD, etc., 
are found; if now, the student will refer to the suffixes of F c , z c t Vet 
etc., he will find the very same digraphs indicated. This being the case, 
the question may be raised as to what value there is in listing both the 
prefixes and the suffixes to the cipher letters. The answer is that by 
so doing the trigraphs are indicated at the same time. For example, in 
the case of D c , the following trigraphs are indicated: 



FDZ, ZDY, ZDS, VDF, ADZ, YDZ, BDV, ZDF, IDZ, ZDF, YDZ, BDV, ZDF, 

ZDZ, ZDT, CDZ, ZDO, YDZ, VDG, SDZ, GDI, ZDF, IDE. 

jg. The repeated digraphs and trigraphs can now be found quite read- 
ily. Thus, in the case of D c , examining the list of digraphs based on 
suffixes, the following repetitions are noted; 

DZ appears 9 times; DF appears 5 times; DV appears 2 times 

Examining the trigraphs with D c as central letter, the following repeti- 
tions are noted: 



ZDF appears 4 times; YDZ appears 3 times; BDV appears 2 times 

h. It Is unnecessary, of course, to go through the detailed proce- 
dure set forth In the preceding subparagraphs in order to find all the 
repeated digraphs and trigraphs . The repeated trigraphs with D c as cen- 
tral letter can be found merely from an inspection of the prefixes and 
suffixes opposite D c in the distribution. It is necessary only to find 
those cases in which two or more prefixes are identical at the same time 
that the suffixes are identical. For example, the distribution shows at 
once that in four cases the prefix to D c is Z c at the same time that the 
suffix to this letter is F c , Hence, the trigraph ZDF appears four times. 
The repeated trigraphs may all be found in this manner. 
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i. The most frequently repeated digraphs and trigraphs are then 
assembled in what is termed a condensed table of repetitions , so as bo 
"bring this information pro'minenbly "before the eye. As a rule, in mes- 
sages of average length, digraphs which occur less than four or five 
times, and trigraphs which 'occur less than three or four times may be 
omitted from the condensed table as being relatively of no importance in 
the study of repetitions. In the condensed table the frequencies of the 
individual letters forming the most important digraphs, trigraphs, etc,, 
should bo indicated, 

4-4, Classifying the cipher letters into vowels and consonants ,— 

a. Before proceeding to a' detailed analysis of the repeated digraphs and 
trigraphs, a very important step can be taken which will be of assistance 
not only in the analysis of the repetitions but also in the final solu- 
tion of the cryptogram. This step concerns the classification of the 
high-frequency cipher letters into two groups— (l) those which most prob- 
ably represent vowels, and (2) those which most probably represent con- 
sonants, For if the cryptanalyst can quickly ascertain the equivalents 
of the four vowels. A, E, I, and 0, and of only the four consonants, N, 

R, S, and T, he will then have the values of approximately two-thirds of 
all the cipher letters that occur in the cryptogram; the values of the 
remaining letters can almost be filled in automatically. 

b. The basis for the classification will be found to rest upon a 

comparatively simple phenomenon: the associational or combinatory be- 

havior of vowels is, in general, quite different from that of consonants. 
If an examination be made of Table 7-B in Appendix 2, showing the rela- 
tive order of frequency of the l8 digraphs composing 25 percent of Eng- 
lish telegraphic text, it will be seen that the letter E enters into the 
composition of 9 of the l8 digraphs; that is, in exactly half of all the 
cases' the letter E is one of the two letters forming the digraph. The 
digraphs containing are as follows: 

ED EN ER ES 

HE RE SE TE VE 

The remaining nine digraphs are as follows: 

AN HD OR ST 

HI HI TH 

ON TO 

c. None of the i8 digraphs is a combination of vowels . Note now 
that of the 9 combinations with E, 7 sure with the consonants N, R, S, and 
T, one is with D, one is with V, and none is with any vowel . In other 
words, Ep combines most readily with consonants but not with other vowels, 
or even with itself. Using the terms often employed in the chemical ana- 
logy, E shows a great "affinity" for the consonants N, R, S, T, but not 
for the vowels. Therefore, if the letters of highest frequency occurring 
in a given cryptogram are listed, together with the number of times each 
of them combines with the assumed cipher equivalent of Ep, those which 
show considerable combining power or af C*inity for the cipher equivalent 
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of Ep may "be assumed to be the cipher equivalents of N, &, S, Tp; those 
which do not show any affinity for the cipher equivalent of Ep may he as- 
sumed to he the cipher equivalents of A, I, 0, Up. Applying these prin- 
ciples to the problem in hand, and examining the triliteral frequency 
distribution, it is quite certain that Zc=Ep, not only because Z c is the 
letter of highest frequency, but also because it combines with several 
other high-frequency letters, such as D c , F c , G c , etc. The nine letters 
of next highest frequency are: 

23 22 19 19 16 15 Ik 10 10 
DTFGVHYS I 

Let the combinations these letters form with Z c be indicated in the fol- 
lowing manner: 

Number of times Z c s 
occurs as prefix — ^ s 

Cipher Letter D(23) T(22) F(19) G(l9) V(l6) H(l5) Y(l4) S(io) l(io) 

Number of times Z c 0 0 — 0 

occurs as suffix — - s 

d. Consider D c . It occurs 23 times in the message and 18 of those 
times - it is combined with Z c , 9 times in the form Z C D C (=E9p), and 9 times 
in the form DcZc (=©Ep). It is clear that D c must be a consonant. In 
the same way, consider T c , which shows 9 combinations with Z c , h in the 
form Z c Tc (=E0p) and 5 in the form T C Z C (=0Ep) . The letter T c appears to 
represent a consonant, as do also the letters F c , G c , and Y c . On the 
other hand, consider V c , occurring In all 1 6 times but never in combina- 
tion with Z c j it appears to represent a vowel, as do also the letters Hq, 
S c , and I c . So far, then, the following classification would seem logical: 

Vowels Consonants 

Z c (=Ep), Vq, H c , S c > I c D c , T c , F c , G c , Y c 

4-5. Further analysis of the letters representing vowels and conso - 
nants . — a. Op is usually the vowel of second highest frequency. Is it 
possible - to determine which of the letters V, H, S, I c is the cipher 
equivalent of Op? Let reference be made again to Table 6 in Appendix 2, 
where it is seen that the 10 most frequently occurring diphthongs are: 

Diphthong 10 OU EA El AI IE AU EO AY UE 

Frequency- 4l 37 35 27 17 13 13 12 12 11 

If V, H, S, I c are really the cipher equivalents of A, I, 0, Up (not 
respectively), perhaps it is possible to determine which is which by 
examining the combinations they make among themselves and with Z c T= E p) • 
Let the combinations of V, H, S, I, and Z that occur in the message' be 
listed. There are only the following: 

ZZ C — 4 VH C — 2 KH C — 1 HI C — 1 IS C ~ 1 SV C — 1 

ZZ C is of course EE^. Note the doublet HH C ; if H c is a vowel, then the 
chances are excellent that II C zO-n because the doublets AA-p, Up, UUp, are 
practically non-existent, whereas the double vowel combination 00 p is of 
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next highest frequency to the double vowel combination EEp. If H c rOp, 
then Vc must he Ip because the digraph VHc occurring two times in the 
message could hardly he AOp, or UOp, whereas the dip thong I0p is the one 
of high frequency in English, So far then, the tentative (because so far 
unverified) results of the analysis are as follows: 

Zc“Ep H c =Op V c =I p 

This leaves only two letters, I c and S c (already classified as vowels) to 
he separated into Ap and Up. Note the digraphs: 

HIc-OQp - I3 c :e© p SV c = 0 I p 

Only two alternatives are open: 

(1) Either I c rAp and S c =tjp, 

( 2 ) Or I c =Up and S G — Ap« 

If the first alternative is selected, then 

BTcSOAp IS c sAU p SV c =UI p 

If the second alternative is selected, then 

KE c =OUp IS c =UAp SV c =AIp 

The eye finds it difficult to choose between these alternatives; hut sup- 
pose the frequency values of the plaintext diphthongs as given in Table 6 
of Appendix 2 are added for each of these alternatives, giving the follow- 
ing: 

HI 0 sOAp, frequency valuer 7 HI c rOUp, frequency value =37 

SV c rUIp, frequency values 5 SV c =AIp, frequency value=17 

IS c =AUp, frequency value=13 IS c sUA p , frequency values 5 



Total 25 Total 59 

Mathematically, the second alternative appears to be more probable than 
the first .7 Let it be assumed to be correct and the following (still 
tentative) values are now at hand: 



2>c = Ep Hc=Op V c sl p S c =A p I G =Up 

b. Attention is now directed to the letters classified as conso- 
nants: How far is it possible to ascertain their values? The letter D c , 

from considerations of frequency alone, would seem to be Tp, but its 
frequency, 23, is not considerably greater than that- for T c . It is not 



7 A more accurate guide for choosing between the alternative groups 
of digraphs could be obtained through a consideration of the logarithmic 
weights of their assigned probabilities, rather than their plaintext 
frequency values. These weights are given in Appendix 2, along with an 
explanation of the method for their derivation; a detailed treatment of 
their application is presented in Military Cryptanalysis, Part II. 
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much greater than that for F c or G c , yith a frequency of 19 each, But 
perhaps it is possible to ascertain not the value of one letter alone hut 
of two letters at one stroke. To do this one may make use of a te bra- 
graph of considerable importance in English, viz ., TIOHp . For if the 
analysis pertaining to the vowels is corroct, and if VK^IOp, then an 
examination of the letters immediately before and after the ^digraph VH C 
in the cipher text might disclose both Tp and Ifp. Reference to the text 
gives the following: 

GVHT C FVBT C 

©I09p 0XO9p 

The letter T c follows VH C in both cases and very probably indicates that 
Tc^Npj but as to whether G c or F c equals Tp cannot be decided. However, 
two conclusions are clear: first, the letter D c is neither Tp nor Hp, 

from which it follows that it must be either Rp or Sp; second, the let- 
ters G c and F e must be either Tp and Sp, respectively, or Sp and T p , 
respectively, because the only tetragraphs usually found (in English) 
containing the diphthong I0p as central letters are SIOHp and TIOHp. 

This in turn means that as regards D c , the latter cannot be either Rp or 
Spj it must be Rp, a conclusion which is corroborated by the fact that 
ZD 0 (sERpJ and DZe (=REp) occur 9 times each. Thus far, then, the iden- 
tifications, when inserted in an enciphering alphabet, are "as follows : 

Plain ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Cipher S Z V TH DGFI 

F G 

46. Substituting deduced values in the cryptogram . — a. Thus far the 
analysis has been almost purely hypothetical, for as yet not a single one 
of the values deduced from the foregoing analysis has been tried out in 
the cryptogram. It is high time that this be done, because the final 
test of the validity of the hypotheses, assumptions, and identifications 
made in any cryptographic study is, after all, only this: do these hy- 

potheses, assumptions, and identifications ultimately yield verifiable, 
intelligible plain text when consistently applied to the cipher text? 

b, At the present stage in the process, since there are at band the 
assumed values of but 9 out of the 25 letters that appear*, it is obvious 
that a continuous "reading" of the cryptogram can certainly not be expect- 
ed from a mere insertion of the values of the 9 letters. However, tbe 
substitution of these values should do tvo things . First, it should 
immediately disclose the fragments, outlines, or "skeletons" of "good" 
words in the text; and second, it should disclose no places in the text 
where "impossible" sequences of letters are established. By the first 
is meant that the partially deciphered text should show the outlines or 
skeletons of words such as may be expected to be found in the communi- 
cation; this will become quite clear in the next subparagraph. By the 
second is meant that sequences, such as "AOOEH" or "THRSEIIO" or the like, 
obviously not possible or extremely unusual in normal English text, must 
not result from the substitution of the tentative identifications result- 
ing from the analysis . The appearance of several such extremely unusual 
or impossible sequences would at once signify that one or more of the as- 
sumed values is incorrect. 
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c. Hero are the results of substituting the nine values which have 
been deduced by the reasoning based on a classification of the high- 
frequency letters into vowels and consonants and the study of the members 
of the two groups : 

1 2 3 4 8 6 7 8 9 10 U 12 13 14 1$ 18 17 18 19 20 21 22 23 24 23 

10 19 23 35 19 J>> 3 19 18 * 5 85 19 19 85 23 14 10 5 19 15 4 35 23 10 

A SFCZPZOGHLPZPGZDTSPFHBZDS 
ATRET SO ETSER A TO ERA 
S S T ST S' 

19 18 16 22 19 3 8 8 10 23 19 19 14 18 8 18 19 18 18 22 19 8 23 35 36 

B GVHTFUPLVDFGYVJVFVHTGADZZ 
SIO'NT IRTS I ITI0NS REE 

T S ST ST 

8 10 22 14 23 35 14 19 35 3 35 22 19 5 22 18 22 35 4 23 18 19 15 22 35 

C AITYDZYFZJZTGPTVTZBDVFHTZ 
N RE TE ENS NINE RITONE 

ST S 

23 19 8 10 4 19 10 23 35 14 15 22 8 3 10 14 18 22 3 19 18 2 19 36 35 

D D F X S B G I'D ZYVTXOIYVTEFVMGZZ 
RT A S RE IN IN TI SEE 

ST ST 

22 15 5 5 18 8 35 23 10 2 15 22 35 8 10 22 14 23 35 14 4 23 18 19 15 

E thlLvxzdfmhtzaitydzybdvfh 

NO I ERT ONE N RE RITO 
S S 

22 35 23 19 2 35 23 35 38 3 10 3 10 10 19 36 14 19 8 18 19 10 8 19 35 

F TZDFKZDZZJSXISGZYGAVFSLGZ 
NERT EREE A ASE S ITA SE 

S T T S T 

23 22 15 15 22 1 23 35 2 10 18 22 14 35 23 3 25 19 19 15 22 35 8 10 22 

G DTHHTCDZRSVTYZDOZFFHTZAIT 
RNOON RE AIN ER ETT0NE N 

S S 

14 23 35 14 19 8 18 23 19 35 36 22 2 15 10 22 14 35 14 10 23 35 10 15 8 

H YDZYGAVDGZZTKHITYZYSDZGHU 
RE S IRSEEN 0 N E ARESO 

T T T 

86 19 36 22 IS 5 6 10 23 10 8 1 8 8 1 8 10 2 5 35 23 19 5 10 23 

J ZFZTGUPGDIXWGHXASRUZDFUID 
ETENS SR SO A ERT R 

S T T T S 



8 19 15 22 18 3 8 10 8 8 

K EGHTVEAGXX 
S 0 N I ’ S 

T T 
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d. Ho impossible sequences ere brought to light, and, moreover, 
several long words, nearly complete, stand out in the text. Hote the 
following portions: 

HBZDS6VHTF 

(1) 07ERASI0NT 

T S 

CI5 

TVTZBDVFHTZDF 

(2) NINE7RIT0NERT 

S S 

F22 

SLGZDTHHT 

(3) A7SERN00N 

T 



The words are obviously OPERATIONS, NINE PRISONERS, and AFTERNOON. The 
value G c is clearly Tp; that of F c Is S p ; and the following additional 
values are certain: 

Bc-Pp L c =Fp 



kj. Completing the solution . — a. Each time an additional value is 
obtained, substitution is at once made throughout the cryptogram. This 
leads to the determination of further values, in an ever-widening circle, 
until nil the identifications ore firmly and finally established, and 
the message Is completely solved. In this case the decipherment is as 
follows : 

1 3 3 i S 6 7 8 9 10 11 12 13 J4 15 18 17 IS 19 20 21 23 23 24 25 



A 

B 

C 

D 



E 



F 

G 

II 

J 

K 



SFDZFIOGHLPZF 

ASRESULTOFYES 

GVHTFUPLVDFGY 

TIONSBYFIRSTD 

AITYDZYFZJZTG 

HUNDREDSEVENT 

DFXSBGIDZYVTX 

RSCAPTUREDINC 

THLLVXZDFMHTZ 

NOFFICERSXONE 

TZDFKZDZZJSXI 

NERSWEREEVACU 

DTHHTCDZRSVTY 

RN00NQREMAIND 

YDZYGAVDGZZTK 

DREDTHIRTEENW 

ZFZTGUPGDIXWG 

ESENTBYTRUCKT 

EGHTVEAGXX 

GT0N1GHTXX 



GZDYSPFHBZDS 

TERDAYSOPERA 

VJVFVHTGADZZ 

IVISIONTHREE 

PTVTZBDVFHTZ 

YNINEPRISONE 

OIYVTEFVMGZZ 
LUDINGS IXTEE 

AITYDZYBDVFH 

HUNDREDPRISO 

SGZYGAVFSLGZ 

ATEDTHISAFTE 

ZDOZFFHTZAIT 

ERLESSONEHUN 

HITYZYSDZGHU 

0UNDEDARET0B 

HXASRUZDFUID 

0CHAMBERS3UR 



oo 
✓ *— 
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Message: AS RESULT OF YESTERDAYS OPERATIONS BY FIRST DIVISION THREE 

HUNDRED SEVENTY HIKE PRISONERS CAPTURED INCLUDING SIXTEEN OFFICERS ONE 
HUNDRED PRISONERS HERE EVACUATED THIS AFTERNOON REMAINDER LESS ONE IHJN- 
DRED 'THIRTEEN WOUNDED ARE TO BE SENT BY TRUCK TO CHAIBERSBURG TONIGHT 

b , The solution should, as a rule, not "be considered complete until 
an attempt has "been mode to discover all the elements underlying the gen- 
eral system and the specific key to a message* In this case, there is no 
need to delve further into the general system, for it is merely one of 
uniliteral substitution with a mixed cipher alphabet. It is necessary 
or advisable, however, to reconstruct the cipher alphabet because thi3 
may give dues that later may become valuable . 

£, Cipher alphabets should, as a rule, be reconstructed by the 
cryptanalyst in the Torn of enciphering alphabets because they will then 
usually be in the form in which the encipherer used them. This is impor- 
tant for two reasons. First, if the sequence in the cipher component 
gives evidence of system in its construction or if it yields clues point- 
ing toward its derivation from a key word or a key phrase, this may often 
corroborate the identifications already made and may lead directly to 
additional identifications . A word or two of explanation is advisable 
here. For example, refer to the skeletonized enciphering alphabet given 
at the end of subpar. ^5b: 



Plain- 
Cipher - 



ABCDEFGHIJKLMNOPQRSTUVWXYZ 

S Z V TH DGFI 

F G 

Suppose the cryptanalyst, looking at the sequence DGFI or DFGI In the 
cipher component, suspects the presence qf a keyword-mixed alphabet. 

Then DFGI is certainly a more plausible sequence than DGFI. Examining 
the skeleton cipher component more carefully, he notes that S . . . Z 
would allow for insertion of three of the missing letters UWXY, since the 
letters T and V occur later, probably In the keyword itself j further, he 
notes that the key vord probably begins under Fp and ends in TH, making 
it probable that the TH is followed by AB or BC . This would mean that 
either P, Qp=A, B c or B, C c , Assuming that P, Qp=A, B c , he refers to the 
frequency distribution and finds that the assumptions Pp=A c and Qp=B c are 
not good; on the other hand, assuming that F, Qp=B, Cc, the frequency 
distribution gives excellent corroboration. A trial of these values 
would materially hasten solution because it is often the case in crypt- 
analysis that if the value of a very low-frequency letter can be surely 
established it will yield clues to other values very quickly. Thus, 
if Qp is definitely identified it almost invariably will identify Up, and 
will give clues to the letter following the Up, since it must be a vowel. 
In the case under discussion the identification P, QprB, C c would have 
turned out to be correct. For the foregoing reason an attempt should 
always be mode in the early stages of the analysis to determine, if 
possible, the basis of construction or derivation of the cipher alphabet; 
as a rule this can be done only by means of the enciphering alphabet, and 
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not the deciphering alphabet. For example, the skeletonized deciphering 
alphabet corresponding to the enciphering alphabet directly above is as 
follows : 



Cipher ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Plain RTSOU AN 1 E 

S T 



Here no evidences of a keyword -mixed alphabet are seen at all. However, 
if the enciphering alphabet has been examined and shows no evidences of 
systematic construction, the deciphering alphabet should then be examined 
with this In view, because occasionally it is the deciphering alphabet 
which shows the presence of a key or keying element, or which has been 
s ys temat ic allj r derived from a word or phrase. The second reason why it is 
important to try to discover the basis of construction or derivation of 
the cipher alphabet is that it affords clues to the general type of key 
words or keying elements employed hy the enemy. This is a psychological 
factor, of course, and may be of assistance in subsequent studies of his 
traffic. It merely gives a clue to the general type of thinking Indulged 
in hy certain of his cryptographers . 

d. In the case of the foregoing solution, the complete enciphering 
alphabet is found to be as follows: 



Plain ABCDEFGHIJKLMNOPQR’STUVWXYZ 

Cipher SUXYZ LEAVE WORT II BCDFGIJKMP 



Obviously, the letter Q, which is the only letter not appearing in the 
cryptogram, should follow P in the cipher component. Hobo now that the 
latter is based upon the keyword IEAVEHWORTH, and that this particular 
cipher alphabet has been composed by shifting the mixed sequence based 
upon this keyword five Intervals to the right so that the key for the 
message is Ap-S c Note also that the deciphering alphabet fails to give 
any evidence of keyword construction based upon the word LEAVENWORTH. 



Cipher ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Plain HPQRGSTOUVWFXJLIZMANBIKCDE 



<j. If neither the enciphering nor the deciphering alphabet exhibits 
characteristics which give indication of derivation from a key irord hy 
some form of mixing or disarrangement, the use of such a key word for 
this purpose is nevertheless not finally excluded as a possibility. For 
the reconstruction of such mixed alphabets the cryptanalyst must use 
ingenuity and a knowledge of the more common methods of suppressing the 
appearance of key words in the mixed alphabets . Several of these methods 
are given detailed treatment in par. 51 below. 

f . It is very important in practical crypt analytic work to pre- 
pare a technical summary of the solution of a system. Step-by-step 

TJ 

u It Is usual practice to employ as the specific key the equivalent 
of eithei’ Ad, or the equivalent of the first letter of the plain compo- 
nent when tlais component is a mixed sequence. 
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commentaries should accompany an initial solution; the steps taken should 
he jotted down as they are made , and at the end they should he confined 
into a complete"r?s'um£ "of the analysis. The r£sumS should he brief and 
concise, yet comprehensive enough that at any future time the solution 
may he reconstructed following the exact manner in which it was origi- 
nally accomplished. Assumptions of words, etc., should he referred to 
with work sheet line- and column-indicators, and should he couched in the 
proper cryptologic language or symbols. A short exposition of the mech- 
anics of the general system, enciphering alphabets, enciphering diagrams, 
etc., as well as all key words (together with their derivation) and spe- 
cific keys should he included. On the work sheet there should he a 
letter-for-letter decryptment under the cipher text; the final plaintext 
version should be in word lengths, with any errors or garbles corrected. 
Hulls or "indicators showing sentence separation, change of key, etc., may 
he enclosed in parentheses . All work sheets and notes should he kept 
together with the solution. 

48. General remarks on the foregoing solution . — a. The example 
solved above is admittedly a more or less artificial illustration of the 
steps in analysis, made so in order to demonstrate general principles. 

It was easy to solve because the frequencies of the various cipher let- 
ters corresponded quite well with the normal or expected frequencies. 
However, all cryptograms of the same monoalphabetical nature can he 
solved along the same general lines, after a certain amount of experi- 
mentation, depending upon -the length of the cryptogram, and the skill 
and experience of the cryptanalyst .9 

h. It is no cause for discouragement if the'-student 1 s initial 
attempts to solve a cryptogram of this type require much more time and 
effort than were apparently required in solving the foregoing purely 
illustrative example. It is indeed rarely the case that every assumption 
made by the cryptanalyst proves in the end to have been correct; more 
often it is the case that a good many of his initial assumptions are In- 
correct, and that he loses much time in casting out the erroneous ones. 
The speed and facility with which this elimination process is conducted 
is in many cases all that distinguishes the expert from the novice. 

r ' ~ 'o 

The use of monoalphahetic substitution in modern, military operations 
is exceedingly rare because of the simplicity of solution. However, Buch 
cases have occurred, and one rather illuminating instance may he cited. 

In an Important communication on 5 August 1918, General Kress von 
Kressenstein used a single mixed alphabet, and the intercepted radio mes- 
sage was solved at American GHQ very speedily. A day later another mes- 
sage, but In. a very much more difficult cipher system, was Intercepted 
and solved. When translated, it read as follows: 

"GHQ Kress : 

The cipher prepared by General von Kress was at once solved here. 

Its further use and employment is forbidden. 

Chief Signal Officer, Berlin.” 
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q. Nor •will the student always find that the initial classification 
into vowels and consonants can he accomplished as easily and quickly as 
was apparently the case in the illustrative example. The principles In- 
dicated are very general in their nature and applicability, and there are, 
in addition, some other principles that may he brought to hear in case of 
difficulty. Of these, perhaps the most useful are the following: 

(1) In normal English it is unusual to find more than two consonants 
in succession, each of high frequency. If in a cryptogram a succession 
of three or four letters of high-frequency appear in succession, it is 
practically' certain that at' least one of these represents a vowel. 

(2) Successions of three vowels are rather unusual in English. •*- 3 - 
Practically the only time bhis happens is when a word ends in two vowels 
and the next word begins with a vowel, 12 

(3) When two letters already classified as vowel-equivalents are 
separated by a sequence of six or more letters, it is either the case 
that one of the supposed vowel-equivalents is incorrect, or else that 
one or more of the intermediate letters is a vowel-equivalent.^ 

(4) Reference to Table 7-B of Appendix 2 discloses the following: 

Distribution of first 18 digraphs forming 25 percent of English text 

Number of consonant-consonant digraphs k 

Number of consonant-vowel digraphs -r — 6 

Number of vowel-consonant digraphs — 8 

Number of vowel-vowel digraphs 0 

Distribution of first 53 digraphs forming gO percent of English text 

Number of consonant-consonant digraphs < 8 

Number of consonant -vowel digraphs — 23 

Number of vowel -consonant digraphs — l8 

Number of vowel-vowel digraphs' 4 



1( ^ Sequences of seven consonants are not impossible, however, as in 
STR ENGTH THRO UGH. 

11 Note that the word RA DIOED , past tense of the verb RADIO, is coming 
into usage. 

12 A sequence of seven vowels is not impossible, however, as in THE 
WAY YOU EARN. 

13 

Some cryptanalysts place a good deal of emphasis upon this principle as a method of locating the remaining 
vowels after tho first two or three have been located. They recommend that the latter bo underlined throughout 
tho toxt and then all sequences of five or more letters showing no underlines be studied attentively. Certain 
letters which occur in several such sequences are sure to be vowels. An arithmetical aid in the study is as follows*. 
Take a letter thought to be a good possibility as the cipher equivalent of a vowel (hereafter termed a possible 
vowel-equivalent) and find the length of each interval from the possible vowel-equivalent to the next known (fairly 
surely determined) vowel-equivalent. Multiply the interval by the number of times this interval Is found. Add 
the products and divide by the total number of intervals considered. This will give the mean interval for that 
possiblo vowel-equivalent. Do the same for all the other possible vowel-equivalents. The one for which the 
mean is the greatest is most probably a vow’el-equivalent. Underline this letter throughout the text and repeat 
the process for locating additional vowel-equivalents, if any remain to be located. 
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The lattei’ tabulation shows that of the first 53 digraphs which form 50 
percent of English text, 4l of them, that is, over 75 percent, are qpm- 
binations of a rowel with a consonant. In short, in normal English the 
vpwels and the high-frequency consonants are in the long run distributed 
fairly evenly and regularly throughout the text. 



(5) As a rule, repetitions of trigraphs in the cipher text are com- 
posed of high-frequency letters forming high-frequency combinations. The 
latter practically always contain at least one vowel; in fact, if refer- 
ence "is made to Table 10-A of Appendix 2 it will be noted that 3& of the 
5 6 trigraphs having a frequency of 100 or more contain one vowel, 17 of 
them contain two vowels 3 and only three of them contain no vowel. In the 
case of tetragraph repetitions, Table 11-A of Appendix 2 shows that no 
tetragraph listed therein fails to contain at least one vowel; 27 of them 
contain one vowel, 25 contain two vowels , and 2 contain three vowels , 



(6) Quite frequently when two known vowel-equivalents are separated 
^hy six or more letters none of which seems to be of sufficiently high 
frequency to represent one of the vowels A E I 0, the chances are good 
that the cipher-equivalent of the vowel U or Y is present. 

d. To recapitulate .the general principles, vowels may then be dis- 
tinguished from consonants in that they are usually represented by: 



( 1 ) 

( 2 ) 

(3) 

,(4) 

letters 



high-frequency Tetters; ' - 

high-frequency letters which do not readily contact each other; 
high-frequency letters which have a great variety of contact; 

i ' I 

high-frequency letters which have an affinity for low-frequency 
(i.e ' • ’ ! 






low -frequency plaintext consonants) , 

e. In the foregoing example the amount of experimentation or ” cut- 
ting and fitting” was practically nil, (This is not true of real cases 
as a rule,) Where such experimentation is necessary, the underscoring of 
all repetitions of several letters is very essential, as it calls attent- 
ion to peculiarities of structure that often yield clues . 



f . After a few basic assumptions of values have been made, if short 
words or skeletons of words do not become manifest, it is necessary to 
make further assumptions for unidentified letters. This is accomplished 
most often by assuming a word.^ iJow there are two places in every mes- 
sage which lend themselves more readily to successful attack by the assump- 
tion of words than do any other places --the very beginning and the very 
end of the message. The reason is quite obvious, for although words may 
begin or end with almost any letter of the alphabet, they usually begin 



p. 

_ _ ‘This process does not involve anything more mysterious than ordinary, logical reasoning; there is nothing 
of the subnormal or supernormal about it. If cryptanalytic success seems to require processes akin to those of 
medieval magic, if "hocus-pocus” is much to the fore, the student should begin to look for items that the claimant 
of such success has carefully hidden from view, for the mystification of the uninitiated. Tf the student were to 
adopt, as his personal motto for all his cryptanalytic ventures the quotation (from Tennyson’s poem Cohimbas) 
appearing on the back of the title page of this text, he will frequently find “short cuts” to his destination and will 
not too often be led astray! 
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and end with but a few v§fy coalition digraphs and trigraplis . Very often 
the association of letters in peculiar combinations will enable the stud- 
ent to note where one word ends and the next begins. For example suppose, 
E, N, S, and T have been definitely identified, and a sequence like the 
following Is found in a cryptogram: 

. ..EETSHE... 

Obviously the break between two words should fall either after the S of 
E H 1 S or after the T of E IT T, so that two possibilities are offered: 

. . , BUIS /HE . . or . . . E N T / S N E . . .. Since in 
English there are very few words with the initial trigraph S N E, it is 
most likely that the proper division ia«. •ENTS/HE. ... Of 
course, when several word divisions have been found, the solution is 
more readily achieved because of the greater ease with which assumptions 
of additional new values may be made . 

g. Although a considerable amount of detailed treatment has been 
devoted to vowel-consonant analysis, it is felt advisable again to caution 
the student against the natural tendency to accept without question the 
results of any one cryptanalytic technique exclusively, even one such as 
vowel -consonant analysis which seems quite scientific in character. 

*1-9. The "probable -word" method? its value and applicability .— a. In 
practically all cryptanalytic studies, short cuts can often be made~by 
assuming the presence of certain words in the message under study. Some 
writers attach so much value to this kind of an "attack from the rear" 
that they practically elevate it to the position of a method and call it 
the "intuitive method" or the "probable-word method." It is, of course, 
merely a refinement of what in everyday language is called "assuming" or 
"guessing" a word in the message. The value of making a "good guess" can 
hardly be overestimated, and the cryptanalyst should never feel that he 
is accomplishing a solution by an illegitimate subterfuge when he has 
made a fortunate guess leading to solution. A correct assumption as to 
plain text will often save hours or days of labor, and sometimes there 
is no alternative but to try to "guess a word", for occasionally a system 
is encountered the solution of which is absolutely dependent upon this 
artifice . 

b. The expression "good guess" is used advisedly. For it is "good" 
in two respects . First, the cryptanalyst must use care in making his 
assumptions os to plaintext words . In this he must be guided by extra- 
neous circumstances leading to the assumption of probable words — not just 
any words that coma to his mind. Therefore he must usc’h'is imagination 
but he must nevertheless carefully control it by the exercise of good 
judgement. Second, only if the "guess" is correct and leads to solution, 
or at least puts him on the road to solution, it is a good Guess, But, 
while realizing the usefulness and the time and labor-saving features of 
a solution by assuming ’a probable word, the cryptanalyst should exercise 
discretion in regard to how long he may continue in his efforts with this 
method. Sometimes he may actually waste time by adhering to the method 
too long, if straightforward, methodical analysis -\rill yield results more 
quickly. > 
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_o. Obviously, the "probable -word' 1 method has much more applicability 
when working upon material the General nature of which is known, than when 
working upon more or loco isolated comraunioabionn exchanged between corres- 
pondents concerning whon or whose activities nothing is known. For in the 
latter case there io little or nothing that the imagination can seize upon 
as a background or bos is for the assumptions However, in the ease of 
military cryptanalysis Ln time of active operations there is, indeed, so 
great a probability that certain words and expressions ore present in cer- 
tain cryptograms that those words and expressions ("cliches") are often 
referred to as "cribs" (as defined in Webster's New Collegiate Diction- 
ary: "...a plagiarism; hence, a translation, etc,, to aid a student in 

reciting."). The cryptanalyst is quite sure they ere present in the 
cryptogram under examination— wliat he must do is to "fit' the crib to 
the text", that is, locate it ln the cipher text. 

-d. Very frequently, the choice of probable words is aided or limit- 
ed by" the number and positions of repeated letters . These repetitions 
may be patent — that is, externally visible in the cryptographic text as 
it originally stands — or they may be latent — that is, externally invis- 
ible but susceptible of being made patent as a result of the analysis. 

For example, in a monoalphabetic substitution cipher, such as that dis- 
cussed in the preceding paragraph, the repeated letters are directly 
exhibited in the cryptogram; later the student will encounter many cases 
in which the repetitions are latent, but are made patent by the analyt- 
ical process. When the repetitions are patent, then the pattern or for- 
mula to which the repeated letters conform is of direct use in assuming 
plaintext words; and when the text is in word-lengths, the pattern is 
obviously of even greater assistance. Suppose the cryptanalyst is deal- 
ing with military text,. In which case he may expect such words as DIVIS- 
ION, BATTALION, etc., to be present in the text. The positions of the 
repeated letter I in DIVISION, of the reversible digraph AT, TA in BAT- 
TALION, and so on, constitute for the experienced cryptanalyst tell-tale 
indications of the presence of these words, even when the text is not 
divided up into its original word lengths. 

e. The important aid that a study of word patterns can afford in 
cryptanalysis warrants the use of definite terminology and the establish- 
ment of certain data having a bearing thereon. The phenomenon herein 
under discussion, namely, that many words are of such construction as 
regards the number and positions of repeated letters as to make them 
readily identifiable, will be termed idiomorphism (from the Greek "idios"a 
one’s own, individual, peculiar 4* "morphe’ , =form) Words which show this 
phenomenon will be termed idj omorphlc . It will be useful to deal with 
the ldiomorphisms symbolically and systematically as described below. 



. — 

•General GIvlerge in his Conn de Ci yplographie (p 121) says. "However, expert cryptanalysts often 
employ Buch details as are cited above [in connection with assuming the presence of ‘probable words’], and the 
experience of the years 1914 to 1918, to cite only those, prove that in practice one often has at his disposal ele- 
ments of this nature, permltti lg assumptions much moro audacious than those which served for the analysis 
of the last example. The reader would therefore be wrong ln imagining that such fortuitous cloments are 
encountered only in cryptographic works where the author deciphers a document that ho himself enciphered. 
Cryptographs correspondence, if it is extensive, and if sufficiently numerous working data are at hand, often 
furnishes elements so complete that an author would not dare usb all of them in solving a problem for fear of 
being accused of obvious exaggeration." 
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f . When dealing vith cryptograms in which the word lengths are 
determined or specifically shown, it is convenient to indicate their 
lengths and their repeated letters in some easily recognized manner or 
hy formulas. This is exemplified, in the case of the word DIVISION, by 
the formula ABCBDBEF ; in the case of the word BATTALION, "by’ the formula 
ABCCBDEFG . If the cryptanalyst, during the course of his studies, makes 
note of striking formulas he has encountered, with the words which fit 
them, after some time he will have assembled a quite valuable body of 
data. And after more or less complete lists of such formulas have been 
established in some systematic arrangement, a rapid comparison of the 
idiomorphs in a specific cryptogram with those in his lists will be feas- 
ible and will often lead to the assumption of the current word. Such 
lists can be arranged according to word length, as shown herewith: 



3/aba 


DID, EVE, EYE, etc. 


abb 


ADD, ALL, ILL, OFF, etc. 


4/abac 


ARAB, AWAY, etc. 


abbc 


ALLY, BEEN, etc. 


abca 


AREA, BOMB, DEAD, etc. 


abcb 


ANON, CEDE, etc. 


etc. 


etc. 



_g. When dealing with cryptographic text in which the lengths of the 
words are not indicated or otherwise determinable, lists of the foregoing 
nature are not so useful as lists in which the words (or parts of words) 
ere arranged according to the intervals between identical letters, in the 
following manner: 



1 Interval 
-DiD- 
-EvE- 
-EyE- 
dlvlsion 
revision 
etc. 



2 Intervals 
AbbAcy 
ArAbiA 
AblAtive 
AboArd 
-AciA- 
etc. 



3 Intervals 
AbeyAnce 
hAbitAble 
lAborAtory 
AbreAst 
AbroAd 
etc . 



Repeated digraphs 
COCOa 
-dERER 
ICICle 
-Mg 
bAGgAGe 
etc. 



h. The most usual practice, however, in designating idiomorphic 
patterns and classifying them into systematic lists is to assign a lit- 
eral nomenclature to that portion of a word (or sequence of plaintext 
letters) which contains the distinctive pattern, beginning with the first 
letter which is repeated in the pattern and ending with the last letter 
which is repeated in the pattern. Thus, the word DIVISION would be term- 
ed as an idiomorph of the abaca class (based on the sequence TVTST con- 
tained therein), and t he wo rd BATTALION as an idiomorph of the abba class 
(based on the sequence ATTA) . In Appendix 3 will be found a compendium 
of the more frequent military words in English, arranged according to 
word-lengths in alphabetical order and in rhyming order; in addition, 
there will be found in this appendix a listing of idiomorphs arranged 
first according to pattern and then according to the first letter of the 
idiomorphic sequence. 
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50 < Solution of additional cryptograms produced by the same compon - 
ents^ -'-a. To' return/ after a rather long digression, to the oryptogram 
solved in pars . 44 - 47, once the components of a cipher alphabet have 
been reconstructed, subsequent messages which have been enciphered by 
means of the same components may be solved very readily, and without re- 
course to the principles of frequency, or application of the probable- 
word method. It has been seen that the illustrative cryptogram treated 
in paragraphs 4l - 47 weis enciphered by Juxtaposing the cipher component 
against the normal sequence so that ApsS c , It is obvious that the cipher 
component may be set against the plain component at any one of 2 6 differ- 
ent points of coincidence, each yielding a different cipher alphabet. 
After the components have been reconstructed, however, they become 
known sequences and the method of converting the cipher letters into 
their plain-component equivalents and then completing the plain-component 
sequence 1 ^ begun by each equivalent can be applied to BOlve any crypto- 
gram which has been enciphered by these components. 

b. An example will serve to make the process clear. Suppose the 
following message, passing between the same two stations as before, was 
intercepted shortly after the first message had been solved: 

IYEWK CERITW OFOSE LFOOH EAZXX 

It is assumed that the same components were used, but with a different 
key letter. First the initial two groups are converted into their 
plain-component equivalents by setting the cipher component against the 
plain component at -any arbitrary point of coincidence. The initial let- 
ter of the former may as well be set against A of the latter, with the 
following result : 

Plain ABCDEFGHIJKLMNOPQRSTUVWXYZ 

qipher LEAVNWORTHBCDFGIJKMPQSUXYZ 

"‘-Cryptogram-—- IYEWK CERNW 

Equivalents— PYBFR LBHEF ... 

The plain component sequence initiated by each of these conversion equiv- 
alents is now completed, with the results shown in Fig. 15. Rote the 
plaintext generatrix, CLOSEYOURS, which manifests itself without further 
analysis. The rest of the message may be read either by continuing the 
same process, or, what is even more simple, the key letter of the message 
may now be determined quite readily and the message deciphered by its 
means. 



° It must be noted that If the plain component is a mixed sequence % 
then it is this mixed sequence which must be used to complete the 
columns , 
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IYEWKCERNW 
PYBFRLBHEF 
QZCGSMCIFG 
RADHTNDJGH 
SBEIUOEKHI 
TCFJVPFLIJ 
UDGKVF QGMJK 
VEHLXRHNKL 
WFIMYS IOLM 
XGJIZTJPMN 
YHKOAUKQNO 
ZILPBVLROP 
AJMQCWMSPQ 
BKNRDXNTQR 
*C LOSEYOUES ' 

DMPTFZPVST 
ENQUGAQWTU 
FORVHBRXUV 
GPSWICSYVW 
HQTXJDTZWX 
IRUYKEUAXY 
JSVZLFVBYZ 
KTWAMGWCZA 
LUXE NHXDAB 
MVYCOIYEBC 
BTWZDPJZFCD 
OXAE QKAGDE 

Figure 15. 

c. In order that the student may understand without question just 
what is involved in the latter step, that is, discovering the key letter 
after the first two or three groups have Been deciphered "by the conver- 
sion-completion process, the foregoing example will he used. It was 
noted that the first cipher group was finally deciphered as follows: 

Cipher- - I Y E W K 

Plain CLOSE 

Wow set the cipher component against the normal sequence so that Cp=I c , 
Thus : 



Plain ABCDEFGHIJKLMNOPQRSTUVWXYZ 

Cipher FGIJKMPQSUXYZLEAVNWORTHBCD 



It is seen here that when Cp=I c then Ap=F c , This is the key for the en- 
tire message. The decipherment may he eong>leted by direct reference to 
the cipher alphabet. Thus: 

Cipher— IYEWK CERWW OFOSE LFOOH EAZXX 

Plain— CLOSE YOURS TATIO WATTW OPM XX 

Message: CLOSE YOUR STATION AT TOO PM 
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d. The student should make sure that he understands the fundamental 
principles involved in this quick solution, for they are among the most 
important principles in cryptanalytics , How useful they are will hecorae 
clear as he progresses into more and more complex cryptanalyfcic studies, 

e. It must he kept in mind that there are four ways' 1 " that two basic 
sequences may he used to form a cipher alphabet, subject *to the instruc- 
tions guiding the cryptographer in the use of his cryptosystem; this fact 
must be considered when additional cryptograms appear in a particular 
cryptosystem for which the primary components have been recovered. Assum- 
ing that the sequences just recovered are labelled "A" and "B M , then the 
following contingencies might arise in the encryption of subsequent mes- 
sages: 

(1) "A" direct for the plain component, and "B" direct for the 
cipher component (as in the original recovery); 

(2) "A" direct for the plain, and "B" reversed for the cipher; 

(3) ”B" direct for the plain, and "A" direct for the cipher; and 

(t) "B" direct for the plain, and "A" reversed for the cipher* 

51, Derivation of key words .— a. Concurrent with the solution of a 

cryptogram, there should be a simultaneous effort in the reconstruction 
of cipher alphabets and recovery of key words , Much labor can thus be 
saved as "recovery of the keys early in the stages of solution may trans- 
form the process of cryptanalysis into one of decipherment, 

b. A mixed cipher alphabet falls into one of five categories, 
according to the composition of its components, viz ,, 

(1) the plain component is the normal sequence and the cipher com- 
ponent is mixed; 

(2) the cipher component is the normal sequence and the plain com- 
ponent is mixed; 

(3) both components are the same mixed sequence; 

(t) both components are the same mixed sequence, but running in 
reverse; or 

(5) the components are different mixed sequences. 

c. Let us examine several types of mixed sequences, using the key 
word HYDRAULIC as an example. The ordinary keyword-mixed sequence pro- 
duced from this key word is; 

(1) HYDRAULICBEFGJKMHQPQSTVWXZ 
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The two principal transposition-mixed types based on this key word are 
derived from the diagrams 



HYDRAULIC 
BEFGJKMNG ’ 

PQSTVWXZ and read: 

(2) HBPYEQDFSRGTAJVUKWLMXINZCO and 

(3) AJVCODFSHBPINZLMXRGTUKWYEQ 

Other types may arise from various types of route transpositions such as 



1 following, 


using 


the 


foregoing 


diagram: 


» 


















(4) 


H 


B 


P 


Q 


E 


Y 


D 


F 


S 


T 


G 


R 


A 


J 


V 


■W 


K 


U 


L 


M 


X 


z 


N 


I 


C 


0 


(5) 


H 


Y 


B 


P 


E 


D' 


R 


F 


Q 


S 


G 


A 


U 


J 


T 


V 


K 


L 


I 


M 


w 


X 


N 


C 


0 


z 


(6) 


P 


B 


Q 


H 


E 


S 


Y 


F 


T 


D 


G 


V 


R 


J 


W 


A 


K 


X 


U 


M 


z 


L 


N 


I 


0 


c 


(7) 


H 


Y 


D 


R 


A 


U 


L 


I 


C 


0 


N 


M 


K 


J 


G 


F 


E 


B 


P 


Q 


s 


T 


V 


W 


X 


z 


(8) 
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C 
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P 
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S 


T 
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E 


(9) 
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M 
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(10) 
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P 
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Q 
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E 


H 


U 


M 
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F 


Y 


A 


K 
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V 


G 


D 


R 


J 


w 



Any transposition system may he employed to produce a systematically- 
mixed sequence; practicability of method is the only determining factor. 

It must be remembered that the greatest amount of systematic mixing will 
produce a sequence inherently no more secure than a random-mixed alphabet, 

d, The student would do well to construct both enciphering and de- 
ciphering versions of cipher alphabets recovered, as has been previously 
mentioned. For example, in the following case 

Plain: JQNMFHLEBRSKGYZOT ICDUVAWPX 

Cipher: ABCDEFGHIJKLMNOPQRSTUVWXYZ 

no semblance of a key is apparent; but in the inverse form 

Plain: ABCDEFGHIJKLMUOPQRSTUVWXYZ 

Cipher: WISTHEMFRALGDCPYBJKQUVXZNO 

the key-phrase "ROW IS THE TIME FOR ALL GOOD MEN TO COME TO THE AID OF 
THEIR PARTY" is quite clear. In other types of mixed sequences, first 
the one form is attacked, and then if negative results are obtained the 
inverse form is treated, 

e , Let us consider the following cipher alphabet: 

P: ABCDEFGHIJKLMNOPQRSTUVWXYZ 

C: DWZMSOCRYATXBEFUGQHIVJKLNP 

The section V W X seems to comprise superimposed parts of the non-keyword 
J K L 

portions of mixed sequences. Adding Y Z to the plain component, we get 
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V W X Y Z which is certainly consistent as far as alphabetical progres- 
JKLKP 

sion goes, and indicates that the letters M and 0 are present in the key 
word of the cipher component. Continuing in this vein, the section 
MNOQSTVWXYZis rapidly established by correlating both se- 
BEFQHI JKLSfP 

quences . It is obvious that the plain component key word begins right 
after the Z, and that the cipher component key word probably just pre- 
cedes the B. Going to the right, Z R H suggests key words like RHOMBOID, 

PQR • 

RHEUMATISM, etc. These trials are quickly repudiated; therefore we go on 
to Z R E which is acceptable. Z R E K is found wanting, but Z R E P is 
PQS PQST PQSU 

very satisfactory, and this is soon expanded toZREPUBLIC, and in 

FQSUVWXYZ 

a moment or two we recover the complete cipher alphabet: 

P: REPUBLIC AUDFGHJKMOQSTVWXYZ 

C: QSUVWXYZDEMOCRATBFGHIJKLNP 

f . In the example below the student will observe that the alphabets 

are reciprocal: this is an indication of identical sequences at a shift 

of 13 , or that a mixed sequence running against itself in reverse has 
been employed. In this case the W X Y Z points to the latter hypothesis. 

Z Y X W 

P: ABCDE'FGHIJKLMNOPQRSTUVWXYZ 

0: HO JFTDRAKC IMLGBSUVPEQRZYXW 

Starting with the V W X Y Z R cluster, we see that the key word begins 
R Z Y X W V 

with the letter Rj therefore the next letter should be a vowel. Z R A 

W V H 

is not acceptable, but Z R E is fine, shewing that the letter U appears 

W V T 

in the key word. Continuing the same line of reasoning as in the preced- 
ing example, and with a little further experimentation, the final alpha- 
bet is discovered to be 

P: REPUBLICANDFGHJKMOQSTVWXYZ 

C: VTSQOMKJHGFDUAC ILBUPERZYXW 

g. In the next example, all efforts to derive key words on the 

basis~of keyword-mixed sequences are fruitless: the conclusion is there- 

fore drawn that this is a case of a transposition. 

P: ABCDEFGHIJKLMNOPQRSTUVWXYZ 

- C: ACSEJYIGWLFVMHXNKZPBQRDUTO 

Considering the mechanics of the cryptography involved, and assuming for 
the time being that Z is at the bottom of the matrix and not in the key 
word, we start with the: letters to the left, or if this fails, to the 
right of Z in the cipher component, obtaining the column N which is not 
" K 

Z 

incompatible if N is in the key word on the top row. If we place Y to 
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the left of Z and 'build up its column, we get E N which is excellent# 

J K 
Y Z 

This Is expanded into I ME M which quickly "becomes 7^18435269 

GHJK PATlTm E N T 

W X Y Z BODFGItJKO 

QSUVWXYZ 

This last example was very easy "because none of the letters V W X Y Z ap- 
peared in the key word; "but other cases should hardly prove more difficult* 

h, Two additional methods that have been encountered for deriving 
mixed” sequences maybe mentioned. One is a slight modification of the 
preceding paragraph, when the key word contains repeated letters: 

18734 9 * 526 

C 0 M . I T . E . 

ABDFGHJKL 

RPQRSUVWX 

Y Z which produces the mixed sequence: 

CARYEKWFRIGSJVLXMDQOBPZTHU 

The other method 1 b an interrupted-key columnar transposition system: ^ 

5 1 3 4 2 6 

V A L . E Y 
B C) 

B F G H I) 

J K M) 

N 0 F Q) 

R) 

S T U W X Z) which produces the mixed sequence: 

ACFKOTE IXLGMPUHQWVBDJRRSYZ 

The first example will succumb to the treatment outlined in subpar, g, 
whereas the second method is vulnerable owing to the presence of the 
fragments D J H, F K 0, and G M P in the sequence which may he anagram- 
med, Rote the fair-sized fragment BE JOS, composed of an ascending 
sequence of letters; this is an outward manifestation of the interrupted- 
key columnar method, 

i« There are still other methods used for the production of mixed 
sequences, but space does not permit giving further examples. However, 
the student should by this time he able to devise methods of attack for 
any special cases that may present themselves, based upon the crypt- 
analytically exploitable weaknesses or peculiarities inherent in the 
system of cryptography involved. 



17 

It is to be noted that in this particular case the numerical key 
serves two purposes: (l) determining the cut-off point (and therefore 
the number of letters) in each row of the diagram, after the appearance 
of the keyword; and (2) determining the order of transcription of the 
columns , 
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SECTION VII 

, .MULTILITERAL SUBSTITUTION WITH SINGLE -EQUIVALENT CIPHER ALPHABETS 

. Paragraph 

General types of multiliteral cipher alphabets . „ 52 

The Baconian and Trithemlan ciphers ......... 53 

Analysis of raultiliteraL, mono alphab et i c ~ subst itut ion ciphers ....... 5^- 

Historically Interesting examples 55 

The international (Baudot) teleprinter code........... §6 



• - -5 2. General types of multil i teral c ip her a lphabets.— a. 

betic substitution methods in general may be classified into 
and multilateral systems . In the former there is a strict '’one-to-ope* 
correspondence between the length of" the units of the plain and those of 
the cipher text; that is, each letter of the plain text is replaced by a 
single character in the cipher text. In the latter this correspondence 
is no longer lptlc but may' be lp:2 e > where each letter of the plaintext 
Is replaced by a combination of two characters in the cipher text; or 
lpj3 c » where a three -character combination in the cipher text represents 
a single letter of the plain text, and so on. A. cipher in which the cor- 
respondence is of the lp!l c type is termed uniliteral - in character; one 
in which it is of the l p :2 0 type, biliteral ; lp:3 c , tri literal , and so on. 
Ciphers in which one plaintext letter is represented by cipher characters 
of two or more elements are classed as multiliteral. 1 - ■ - 

b. Biliteral alphabets are usually composed of a set of 25 or 2 6 
pombinations of a limited number of characters taken in pairs. An 
pxanrple of such an alphabet is the following: 



Plain- 
Cipher - 



Plain- 
Cipher - 



ABC'DEFGHIJKLM 
WW \JE WI WT WE HW " HH HI ' HT HT HE IW IH 

_!n '(T P“ ' q r~ § t’u v w Y y z 

II PE* IE TW TH TI TT TE EW EH El ET EE 



This alphabet is derived from the cipher" square or matrix shown in 
Fig. 1 6 . The cipher equivalent of each plaintext element is made up of 
two coordinate letters from outside the cipher matrix, one letter being 
the coordinate of the row, the other being the coordinate of the column 

1 -The terms unilateral and multiliteral, although originally applied 
only to cipher text composed of letters, are used here in their broader 
sense to embrace cipher text In letters, digits, and even other symbols . 
In more precise terminology, these terms would probably be monosymbolic 
and polys ymbo lie , respectively, but the terms uniliteral and multiliteral 
are too well established in literature to be changed at this late time. 
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In which the plaintext letter is located. In other words, the letters at 
the side and top of the matrix have "been used to designate, according to 

■ C8) 



W 

H 

(1) I 

T 

E 



- ■ - Figure 1 6 . 

a coordinate system, the cell occupied by each letter within the matrix. 
The letters (or figures) constituting the coordinate elements of such 
matrices are termed row and column indicators . - v ' 

o. If a message is enciphered by means of the_foregoing biliteral 
alphabet, the cryptogram is still monoalphabetic in character. A fre.-. 
quency distribution based upon pairs of letters will obviously have all 
the characteristics of a simple, unillteral distribution for .a monoalpha- 
betic substitution .cipher. - 

d. The cipher alphabets shown thus far in this text have involved 

only letters - ,' hut alphabets in which the cipher component consists of 
figures, or groups of figures, are not uncommon in military cryptography ?• 
Since there are hut 10 digits it is obvious that, in order to represent 
an alphabet of more than 10 characters by means of figure ciphers, combi- 
nations of at least two digits are necessary. The simplest kind of such 
an alphabet is that in which 4p=01, Bp=02, . . . Zp=26; that is, one in 
which the plaintext letters have_as their equivalents two-digit numbers 
Indicating their positions _in the normal alphabet. . . - 

e. Instead of a simple alphabet of the preceding type, it is pos- 
sible - " to use a diagram of the type shown in Fig, IT. In this cipher ‘ . 
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Figure 17. 



P 

Although, as an extension of this idea, cipher alphabets employing 
signs and symbols are possible, such alphabets are not suitable for 
modern cryptography because they can be neither telegraphed nor "tele- 
phoned with any degree of accuracy, speed, or facility. 
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tine letter A p is represented "by the dinomo ^ 11, B p "by the dinome 12, 
ete'. Furthermore, this matrix includes provision for the encipherment of 
some of the frequent lyWused punctuation marks in addition to the 2 6 let- 
ters. ; - •• •-- ■- 

f . Other types of "biilteral cipher alphabets are illustrated in the 



examples below: 
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. Figure 19 . 












-Figure lo. 








- 














- 








“ * 


— 




... . 






’ 






“ 






' 




“ - 


- 


- 


- 


— 


« . 




_ 














- 


M 


‘TJ 


N 


I 


C 


H 




A 


B 


C D 


E 


F 


7 T 


H 


I 


B 


G 


7 


E 


5 


R 


M 


A 


A 


D 


G J 


M 


P 


S 


V 


Y 


i' - E 


A 


1 


N 


Y 


B 


2 


B 


B 


E 


II K 


IT 


Q 


I 


W 


Z 


R 


C 


3 


D 


4 


F 


6 


C 


C 


F 


I L 


0 


R 


u 


X 


1 


L 


H 


8 


I 


9 


J 


0 


D 


2 


3 


4 5 


6 


7 


8 


9 


* 


I 


K 


L 


0 


P 


Q 


s 












- -- 








’ IT 


T 


U 


V 


W 


X 


z 








Figure 21 . 









Figure 20. 



g. It is to be noted that in alphabets of the foregoing types, the 
row indicators maybe distinct from the column indicators (e.g.. Fig. 18), 
or they may not (e.g., Fig . 19 ) 5 of course , when there is any duplication 
between the row and column indicators, it is necessary to agree beforehand 
^hporTwhich indicator will be given as' the first half of the equivalent 
for a letter, in order to avoid ambiguity. (In all of the systems de- 
scribed in this and subsequent sections of this text, the row indicator 
will always form' the first 'part of an equivalent). When letters are used 
as row and column indicators they may form a key word (e.g.. Fig. 20), or 
they may not (e.g.. Fig. 21 ) ; the key words, if formed, maybe identical 
(e.g.. Fig. l6) or different (e.g., Fig. 20;. Furthermore, the plaintext 
letters maybe arranged within the matrix as a mixed sequence (e.g.. Fig. 
20), either systematically- or "random-mixed; and the matrix may contain, 
in addition to the letters of the alphabet, punctuation symbols (Fig. 17 ), 
numbers (Figs. 20, 21), etc., permitting their encipherment as such, in- 
stead of having to be spelled out. 



q j.. , 

• J A pair of digits is called a dinome ; similarly, a trinome is a set 
of three digits; a tetranome , a set of four digits; etc. Although a 
single digit would properly be termed a mononome, for the sake of euphony 
it is shortened into the term monorne. 
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h. When letters are used as row and column indicators, they may be 
selected so as to result in producing cipher 'text that resembles arti- 
ficial words ; that is, words composed of alternate vowels and consonants. 
For example , if in Figure l6 the row Indicators consisted of the vowels 
A E I 0 U in this sequence from the top down, and the column indicators 
consisted of the consonants B C D F G in this sequence ’from left to right 
the word BAUDS would he enciphered as OCABE FAFOD, which very'closel/ 
resembles code of the type formerly called artificial code language. 

Such a system may he called a false , or pseudo-code system. 



i.. As a weak type of subterfuge, biliteral ciphers may involve a 
third"" character appended to the ; basic two -character cipher unit; this is 
done to " camouflage" the billteral nature of the cipher text, This third 
character may he produced through the use of a cipher matrix of the type 
illustrated in Fig. 22 (wherein Ap=6ll, Bp =6l2, etc.); or the third char- 
acter may be a "sum-checking" digit which is the non-carrying sum (i.e., 
the sum modulo 10)5 of the preceding two digits, such as in the trinomes 
257, 831, and 662; or it may merely be a randomly -selected character (in- 
serted solely for the purpose of leading the cryptanalyst astray) . 
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Figure 22. - ■ -y / 

" * .‘hr wot 

t-^. Another possibility that lends itself to certain multiliteral 

ciphers is the use of a word- spacer or word: separator . This word sepa- 
rator might be represented by a value in the matrix; i.e,,. the separator 
is enciphered (for instance, the dinojne "39" -in Fig. 19, might stand for 
a word separator). The word separator might instead he a, single element 
not otherwise used^in the cryptosystem; i.e., unenciphered , and thus pot 
giving rise to any possible ambiguity. Thus, in Fig. 19 the digit 0 and 
in Fig. 21 the letter J might be used as word separators, since jxo 
fusion would arif e in decrypting. . - , , iJV , 



Prior to 1934, international telegraph regulations required code 
words of five letters to contain at least one vowel and code woYdg Of ten 
letters to contain at least three vowels. The International Telegraph 
Conference held in Madrid in 1932 amended these regulations to permit the 
use of 5 -letter code groups containing any combination of letters.- These 
unrestricted code groups were authorized for use after J. January 1934. 

J The tern modulo (abbreviated mod ) pertains to a cyclic s.cale or 
basis of arithmetic; thus, in the modulus of 7, the numbers. 8 and 15 are 
equivalent to 1, and 9 and l6 are equivalent to 2, etc.; or expressed 
differently, 8 mod 7 is 1, 9 mod 7 is 2. In cryptology, many operations 
are expressed mod 10 and mod 26. 
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lc. The hiliieral alphabets yielded hy the matrices of Figs. 16-21 
may also he termed bipartit e, because the cipher units of these alphabets 
may be divided into two separate parts whose functions are clearly de- 
fined, viz., vow indicators and column indicators. As will be dismissed 
later, this bipartite nature of most blliteral alphabets produced from 
cipher matrices constitute one of the weaknesses of these alphabets which 
make them recognizable a,s such to a cryptanalyst. However, it is possible 
to employ a cipher matrix in a manner which will produce a billteral 
alphabet not bipartite in character. For example, using the matrix of 
Fig. 23 one could produce the following biliteral cipher alphabet in 
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l 



which the equivalent' for any letter 'in the matrix is the SVUF'pf ijhg 1?w@ 
coordinates which indicate its cell in the matrix? 



Plain ABCDEF''a^X^ii,|i 

Cipher .14 20 19' 12 22 23 24 10 lfi 10 25 if 26 



Plain— 

Clpher- 



NO^'PQRSTUVWXYZ 
28 29 30 31 ‘13 3£ 3^ 1 6 35 36 37 11 38 



The cipher units of this alphabet are, of course, biliteral j but they are 
not bipartite. Note the equivalent of Ap, that is l4 — if divided, it 
yields' the digits 1 and 4 which have no meaning per se ; plaintext letters 
whose cipher equivalents begin with 1 may be found in two different rows 
. of the matrix, and those whose equivalents end in 4 appear in three dif- 
ferent columns . 



- 53* The Baconian and Trithemlan ciphers .— a. An interesting example 
in which the cipher equivalents are five -letter groups and yet "the re- 
sulting cipher is strictly monoalphabetic in character is found in the 
cipher system invented by Sir Francis Bacon (1561-1626) over 300 years 
ago. Despite its antiquity the system possesses certain features of 
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merit which are well worth noting .6 Bacon proposes the following 24- 
element cipher alphabet, composed of permutations of two elements taken 

five at a time: 7 

♦ 



A=aaaaa 

Bzaaaab 

Czaaaba 

Dzaaabb 

Ezaabaa 

Fzaabab 

Graabba 

Hzaabbb 



I-Jzabaaa 

Krabaab 

Lzababa 

Mzababb 

Nzabbaa 

Orabbab 

Pzabbba 

Qzabbbb 



Rzbaaaa 
Sebaaab 
Tzbaaba 
U-V zbaabb 
.Wrbabaa 
Xrbabab 
- Xzbabba 
Zzbabbb 



If this were all there were to Bacon’s invention it would be hardly worth 
bringing to attention. But what he pointed out, with great clarity and 
simple examples , was how such an alphabet might be used to convey a 
secret message by enfolding it in an innocent, external message which 
might easily evade the strictest kind of censorship. As a very crude 
exanjple, suppose that a message is written in capital and lower-case let- 
ters, any capital letter standing for an "a” element of the cipher alpha- 
bets, and any small letter, for a "b" element. Then the external sentence 
"All is well with me today" can be made to contain the secret message 
"Help." Thus: 



A L 1 is 
a a b b b 

*> y — 

H 



WEIL W I t H mE 

aaba a, .aba b a. 

v ~ v ' K y — 

E ’ L 



Today 

a b b b a. 

* — r~~ * 

P 



Instead of employing a device so obvious as capital and small letters, 
suppose that an "a" element be indicated by a very slight shading, or a 



For a true picture of this cipher, the explanation of which is often 
distorted "beyond recognition even by cryptographers, see Bacon's own des- 
cription of it as contained in his De Augment is Scientiorum (The Advance - 
ment of Learning ), as translated by any first class editor, such as Gilbert 
Watts (l640) or Ellis, Spedding, and Heath (1857, 1870) . The student is 
cautioned, however, not to accept as true any alleged "decipherments" ‘ ob- 
tained by the application of Bacon's cipher to literary works of the l6th 
century. These readings are purely subjective. 

7 Bacon's alphabet was called by him a "biliter ai alphabet" because it 
employs permutations of two letters. But from the cryptanalytic stand- 
point the significant point is that each plaintext letter is represented 
by a 5-character equivalent. Hence, present terminology requires that 
this alphabet be referred to as a ouinque literal alphabet . Although the 
quinqueliteral alphabet affords 32 permutations, Bacon used only 24 of 
them, because in the l6th century the letters I and J, U and V were used 
interchangeably. Note the regularity of construction qf Bacon's biliteral 
alphabet, a feature vhich easily permits its reconstruction from memory. 
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very slightly heavier stroke. Then a secret message might easily he thus 
enfolded within an external message of exactly opposite meaning. The 
number of possible variations of this basic scheme is very high. The 
fact that the characters of the cryptographic text are hidden in some 
manner or other has, however, no effect upon the strict monoalphabetic lty 
■of the scheme. . , . 

. - 2 L _s v , , J 1 

b. Almost 100 yearB before Bacon’s time, the abbot Trithemius, 
born Johann von Heyderiberg ( 1462-151 6), invented a triliteral alphabet 
which he evidently intended to use in a fashion similar to Bacon’s 
alphabet | i.e., as a means of disguise or cover for t a secret text.^ This 
alphabet, modified to Include the 2 6 letters of the present-day English 
alphabet, is shown in Fig. 23 belowj it consists of all the permutations 
of three things taken three at a time, i.e., 3^ or 27 in all. 



Aslll 


Dr 121 


' Grl31 


J=211 


M=221 


P=231 


S=3li 


V=321 


Y=331 


B-112 


Erl22 


Hrl32 


K=212 


W:222 


Q:232- 


T:312 


W:322 


Z=332 


C=113 


F:123 


1=133 


L=213 


0:223 


R=233 


U:3'i3 


X:323 


*-333 








' -~s. ‘ 


■* — **. _j 




* 1.4- 


k > « ? 








• — 




Figure 23. 
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. 1 ft 



The cipher text of course does not have to be restricted to digits ; any 
' groupings of three things taken three at a time will do. " ■- * 



54, Analysis of multiiiteral’, "monoalphabetic substitution ciphers . — 

’ a". Biliteral ciphers and those of the other multiiiteral (trillteral , 
quadriliteral, . . .) types are often readily detected externally by the 
fact that "the cryptographic text, is usually Composed of but a very liml- 
- ted number of different characters . ■* They are handled in exactly the same 
’manner' as 'are uniliteral, monoalphabetic substitution ciphers. So' long 
'as the same character, or combination of characters, is always used to 
represent the’ same plaintext lebter, and so long as a given letter of the 
-plain text is always represented by the same character or combination of 
characters , 'the 'Substitution is ‘ strictly monoalphabetic and can be handled 
in the' s iirrple . manner described’ in the preceding section of this text. 

" = * t - - a _ - - -u _ - . . .'j~„ 

b. In the case of biliteral ciphers in which the row and column in- 

dicators are not identical, and the direction of reading the cipher pairB 
is chosen at will for each succeeding cipher pair, an analysis of the 
contacts of the letters comprising the cipher 'pairs will disclose that 
there are tiro distinct families of letters, and a cipher pair will never 
consist of two letters of the same family. With this fact discovered, 
the cipher may be quickly reduced to uniliteral terns and solved in the 
manner previously mentioned. _ .’ ' " -V" . 

c. If a multiliteral cipher includes provision for the encipherment 
of a word separator, the cipher equivalent of this word separator may be 
readily identified because it will, have the highest frequency of any ci- 
pher unit*. ’ On ’the "other hand, if the word separator is a single char- 
acter (see subpar, 52 j.. on the use of the digit 0 and the letter <j), this 
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character may he identified throughout the encrypted text hy its position- 
al appearance spaced "wordlength-wise" in the cipher text, and by the 
fact that it never contacts itself. If this single character is used as 
a null indiscriminately throughout the cipher text, instead of as a word 
separator, the analysis is a bit more complicated but not as great as 
might be thought, , - - - 

d. As a general rule, it is advisable to reduce multiliteral cipher 
text to uniliteral equivalents,- especially if a triliteral frequency dis- 
tribution is to be^made. If not more than 36 different combinations are 
present in a cryptogram, the extra values over 26 maybe represented by 
digits for the purpose of this reduction. If, however, more than 36 
different combinations are found in the encrypted text, it is usually not 
worth the trouble to attempt any uniliteral reduction, and the cipher' 
text can be attacked in its multiliteral groupings . 

e. As one of the first steps in the solution of any multiliteral 
cipher in letters which appears to involve the use of a cipher matrix, 
it is generally advisable to anagram the letters comprising the row. and 
column Indicators in an attempt to disclose any key words for these in- 
dicators . When the anagramming process does disclose such a key word or , 
words, the next step is to make a skeleton reconstruction matrix which Is 
a duplicate of the original enciphering matrix in that the indicators are 
arranged in the same order as on the original. Then, as plain text is 
recovered in the cryptogram hy any of the methods outlined in the. previous 
section of this text, the recovered plaintext ietters should be inserted 
in the proper cells of the reconstruction matrix, so that any systematic 
arrangement of the plaintext letters, if present in the original, may be 
disclosed prior to recovery of the complete plain text.. Furthermore, it 
may in some instances he found worthwhile, immediately after successfully 
uncovering the key words used as indicators, to make a frequency distri- 
bution of the particular cryptogram in the form of tally marks within the 

- properly arranged frame of the reconstruction matrix, -because it may be 
that a few moments 1 study of the locations of the crests and troughs in 
the distribution made in that form may, if the plaintext letters _are ar- 
ranged in the normal sequence or in a keyword~mlxed_ sequence (especially 
if it Is related to the key words for the indicatory), provide a basis 
for the derivation of this sequence at one stroke, without recourse to_ 
analysis of bh. cipher text. ' . ' ‘ ' J r ' \ 

„ « ^ t * * T 

jjg. Historically interesting exampl es, --a. Two examples of multi- 
literal ciphers of historical interest, will be cited as illustrations . 
During the campaign for the presidential election 'oT 1876 (Hayes vs. 
Tilden) many cipher messages were exchanged between the Tilden managers 
and their agents in several states where the voting was hotly contested. 
Two years later the Hew York Tribune^ exposed many irregularities in the 



r " d “ ’ “ 1 1 “ 1 - 

New York Tribune, Extra No. hh , The Cipher Dispatches , New York, 

1879 . 
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campaign by publishing the decipherments of many of" these messages." 
These decipherments were achieved by two investigators employed by the _ 
Tribune, and the plain text of the” messages seems to show that‘s illegal * 
attempts and measures to carry the election for Tilden were made~by his 
managers. Here is one of the messages: 



RANEY, Tallahassee. 



JACKSONVILLE, Nov. 16 (1876). 



Ppyyemnsnyyypim a~s hnsyy'sp it epaaehshns 
pensshnsmmpiyysnppyeaapieissyeshainsssp 
eeiyyshnynsssyepiaanyitnsshyyspyypinsyy 
ssitemeipimmeisseiyye i ss i t e i epyyp e e i aas s 
imaayespnsyyianssseissmmppnsp ins snp i ns i m 
imyyitemyysspeyymmnsyys s i t spyype a p p p m a 
aa'yypiit 

L'Engle goeq up tomorrow." "■ T » • -r- -- ... 

" ’ * ■ DANIEL. 



iepyypeeiaass 
spinssnpins in 
tspyypeapppma 



Examination of the message discloses that only ten different letters are 
used. It is probable, therefore, that what one has here is .a cipher 
which employB a multiliteral alphabet. First assuming that ihe alphabet 
is one in which combinations of two letters represent "single "letters of 
the plain text, the message is rewritten in pairs and substitution Of 
arbitrary letters for the pairs is made, as seen below: 



1 - >" Jacksonville, Nov. lb. 

- '• ' - - : _ . >,f i£; 

GEO. F. RANEY, Tallahassee: ‘ -- 

Have Marble and Coyle telegraph for influential men "’from Delaware -and 
Virginia. Indications of weakening here. Press advantage and watch 
Board. "L’Engle goes up tomorrow. . “ - ■ ■ ^ 



DANIEL. 



b. The other example, using numbers, is as follows: 

* ‘ : “ Jacksonville, Nov, 17 , 

S. PASCO and E. M. L’ ENGLE: 



84 55 84 25 93 34 82 31 31 75 93 82 77 33 55 42 

93 20 93 66 77 66 33 84 66 31 31 93 20 82 33 66 

52 48 44 55 42 82 48 89 42 93 31 82 66 75 31 93 



r 



A triliteral frequency distribution is then made and analysis of the mes- 
sage along the lines illustrated in the preceding section of this text ; 
yields solution, as follows : ~ ’’ -- - 

■ ' - _ _ - - - - -- . ~ . 

1 ’ 1 Jacksonville, Nov. 16. 



DANIEL. 



r - 
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There were, of course, several messages of like nature, and examination 
disclosed that only 2 6 different numbers in all were used. Solution of 
these ciphers followed very easily, the decipherment of the^one given” 
above being as follows: " l " 'y*""' •' • ^ - - . 

» 1 1 • ? 1 -i ‘ r 

Jacksonville, Nov. 17*. 

S. PASCO and E. M, L'ENGLE: 

Cocke will be ignored, Eagan called in. Authority reliable. ' ’ /'•’ 

' * . ' • ■- . .DANIEL. 

- - -- r • - . - c . - . . * . 

c. The Tribune experts gave the following alphabets as the result 
of their decipherments: - _ _ 



AA-0 


EN»Y 


IT=D 


j • 

NSrE 


T 

PP=H 


SS=N 


AI S U 


EP=C 


MA=B 


NYrM 


SH=L 


YE=F 


EI“I 


IA=K 


MM=G 


PE=T 


SN=P 


YIzX 


EM=V 


B1=S 


NN=J 


PI=R 


SF=W 


YY=A 


20 =D 


33=N 


44 =H 


62=X 


77=G 


89=Y 


25=K 


34=^ 


48 =T j 


66 z A 


82=1 


93 =E 


27 =S 


39=P 


52 =U ‘ 


68=F ' 


84 =C 


96 =M 


3l=L 


42 =R 


‘55=0 


75 =B 

» j* 


87=V 


99=J 



They did not attempt to correlate these alphabets, or at least pjiey say 
nothing about a possible relationship . Tfte present author has, however, 
reconstructed the rectangle upon which these alphabets are based, and it 
is given below (Fig. 24)’. 

■ ‘ Letter or Number ' - - 



t i v « 



HI SPAYMENT 
12 34567890 



r it. 


H 1 
12 


* 


S 3 


<D 


1 

P 4 

| 


& 

N 


A 5 


O 

U 

& 


Y 6 


-+=> 


M 7 


*3 

m 

r-M 


E 8 




N 9 
T 0 

















— 






1 








K 




S 






D 


L 




N 


W 










P 






R 




H 








T 








U 


• 




0 














X 








A 




F 














B 




G 










I 




C 






V 




Y 








E 






M 






J 

























Figure 24. 
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It is amusing to note that the conspirators selected as their key a 
phrase quite in keeping with their attempted illegalities - HIS PAYMENT - 
for bribery seems to have played a considerable part in that campaign. 

The blank cells in the matrix probably contained proper names, numbers, 

etc. 9 _ _ - 

■w w ■» >*V t 

56. The international (Baudot) teleprinter code . — -a. Modern print- 
ing telegraph systems ,1^ or teleprinter systems as they - " are more often 
called, make use of a five -unit'' code- 1 - 1 or alphabet which is similar to 
the Baconian alphabet treated in par. 53* Like the Baconian alphabet, 
the teleprinter alphabet Is composed of permutations of two elements 
taken five at a time, making it possible to obtain 32 different permu- 
tations, 26 of which are assigned to the letters of the alphabet, leaving 
1 for an ’’idle condition 1 ' and 5 for certain printer operations called 
functions , such as "space", "figure shift", "letter shift," etc, 

b. During electrical transmission, the two distinct elements of 
which each character is composed take the form of (l) a timed interval of 
electrical current and (2) a timed Interval of no current, which are 
commonly referred to as "mark" impulses and "space" impulses, respective- 
ly. In certain operations, a paper tape is prepared of the traffic to be 
transmitted, or a paper tape may be prepared of the incoming traffic at 
- the receiving end; in such tapes, the elements of the Baudot characters 
take the form of punched holes ("mark" impulses) and imperforate positions 
("space" impulses). . ^ ^ A , - j 



o 

As was mentioned in a previous footnote, a matrix containing such 
items would be termed a syllabary square; for example of such matrices 
- see the treatment of syllabary squares and code charts in Section X. 

10 Such systems are characterized by the transmission and reception - 
printing of messages by electrical means , incorporating two electrically- 
connected instruments , resembling typewriters . When a key of the keyboard 
on the transmitting instrument is depressed, an electrical signal is 
transmitted to the receiving instrument, causing the corresponding char- 
acter to be printed therein. Usually the message is printed at the local 
as well as the distant station. The system has been adapted to radio as 
well as wire and overseas cable transmission*. 

The five-unit code was first applied to teleprinter systems by Jean 
Maurice Emile Baudot > and is commonly known as the Baudot 

code. It Is worthwhile to point out that Baudot apparently constructed 
his alphabet to correspond with normal frequencies of characters (with 
certain exceptions), since the most frequent ones are represented by per- 
mutations requiring the least electrical energy on the basis of "marking" 
and "spacing." In this respect Baudot "took a leaf out of Morse’s note- 
book." 



117 



RECTI IUCTEft 







REF ID:A56892 



RE3TRICTEB - 

£« The teleprinter code in international use Is given in Chart 
helov 7 wherein the mark and space impulses (known collectively as" hands ) 
are illustrated as the holes (shown as "black dots) and ‘'no-holes" of a 
teleprinter tape. The letter equivalents ("lower case") are self-explan- 
atory, The figure shift is used to change the meaning of a particular 
character to an "upper case" equivalent, and when it is desired to return 
to lower case, the letter shift is used; in regular teleprinter usage. 



UPPER 

CASE 


WEATHER SYMBOLS 


t 


© 


0 


s 


3 




£ 


i 


m 






1 




i 


□ 


□ 


a 




□ 


5 

V 


7 

7 


8 


z 


/ 


l 6 _ 

6 


;■ 

2 


£ 


R 1 

< 


E 


Id 

O 

a 


t 

& 

s 


t 

s 

c 

9 


COMMUNICATIONS 


- 


U 




T 


nr 


T 


a 


jC 

H 


1 j 




i 


> 




H 


IV 


0 


1 

■ - 
0 


4 _ 

R 


4 


2 


/ 


LOWER CASE 


A 


i 


c 


0 


E 


fl 


G 


1 


j 


X 


L 


M 


fl 


0 


p 


s 


T 


U 


V 


w 


X 


Y 


0 : 
0 


u. 

J 




1 


0 


• 




• 


e 


□ 




- 


• 


• 

• 


0 

e 

0 












• 


0 


9 




9 




9 

9 

0 


9 


9 


9 






■ 


1 


9 


i 


0 




O 1 




■ 


O 


• 

0 


•_ 

0" 


0 






9 


# 




0 


9 

0 






0 


0 


0 


8 


| 


• 

O 


0 

0 


FEED HOLES 


0 


0 


O 


0 


i 


0 


0 


0 


Q 


0 


0 


0 


o H 


0 


0 


0 


O 

O 


O 


3 






S 




i 


e 




¥ 


• 




• 




0 


9 




0 


9 




9 




• 


0 

9 




9 










i 


9 

© 


0 


4 




0 


9 


e 




Q 


9 






0 


□ 


a 


0 

0 


0 


9 

» 




■ 


9 






9 


0 


9 




¥ 


8 


5 


■ 


□ 


■ 


■ 


■ 


■ 


□ 


□ 








I 9 | 


□ 




■ 


□ 




0 


9 


• 


■ 


9 


□ 


^ 1 


? - *7 











Chart 7 . Internationa 7 teleprinter code, 



t 

t 



the "communications" set of qpjper-cas.e equivalents are the ones recorded 
on the typed copy "by the teleprinter,' whereas the "weather symbols" are 
the upper-case equivalents which are printed in teleprinter systems de- 
signed for the sending and receiving of weather information. The spacq 
is used to separate words; the carriage return (c,R.) effects the retqpff 
of the teleprinter carriage*!;’© the right and the line feed (L.F.) roll? 
the platen to the next _ 3 ine for printing (cf* the corresponding functions 
of an ordinary typewriter) , In addition, when the upper-case equivalent 
of "S" is used, a "bell rings in the receiving teleprinter as a signal to 
call the operator to his machine, or to indicate that traffic is about to 
he sent. ’ — ' * 

- * j - . -»r ■> . i 

d. In Fig. 25 is shown a portion of a teleprinter tape containing 
the beginning of the phrase "How is the time for all good men . . 




• • 



* ••• • 



• • 



• • •• • 
• • © 



• •• • 

•••• » • 

•• • 



NOW IS THE TIME FOR ALL GOOD MEN 



Figure 25. 

The small holes, one of which appears in every position of the tape be- 
tween the second and third levels, are sprocket holes used for advancing 
the tape through the transmitter unit. 
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e. It Is to be emphasized that messages are not made secure from 
unauthorized reading merely by sending them by means of an ordinary tele- 
printer system — the teleprinter alphabet is internationally known, just 
as the English, Russian, etc. alphabets are. In order to provide secur- 
ity for a teleprinter message, it is just as necessary to apply thereto 
some sort of cryptographic treatment as it is to any other kind of mes- 
sage. The cryptosystems used for teleprinter encryption may involve 
either, or both, of the two classes of ci yptographJ c treatment, viz ., 
substitution and transposition. A substitution treatment might involve 
_ changing certain of the mark impulses of the characters comprising a mes- 
sage to space impulses, and vice versa, according to a prearranged sys- 
tem, a transposition treatment might involve changing the order of the 5 
impulses in the Baudot equivalents for the characters comprising a mes- 
sage j and so on. The cryptographic treatment can be accomplished by a 
special cipher attachment (called an "applique unit") to a teleprinter; 
thus ho modification of the teleprinter itself would be necessary. There 
are, of course, self-contained cipher teleprinters designed as such for 
engineering or cryptographic reasons, or both. 



f. In the analysis of encrypted teleprinter systems, recourse is 
had to special tables 12 of the frequencies of single Baudot characters, 
digraphs, trigraphs, etc., as they appear m teleprinter traffic. It is 



ic, as in any other type of 
,or, this character has the 
Furthermore, one of the 



important to note that in teleprinter traf. 
traffic involving the use of a word separa 
highest frequency of any plaintext element 

highest-frequency plaintext digraphs, in aldition to those wherein the 
word separator constitutes one of the elements , will be the com! ination 
"carriage -return/line -feed", since this combination of characters is used 
in the normal procedure of typing each line of text on the teleprinter. 



12 In such tables, as is common in cryptanalytlc practice, the mark im- 
pulses are designated hy a plus symbol ( + ), and the space impulses are 
designated by a minus symbol ( - ) . In addition, it is usual in such 
tables to denote the character representing the figure shift by the digit 
"2", the space by "3", the letter shift by 'V, the line feed by "5", the 
blank by "6" , and the carriage return by "7" . 
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- - * SECTION VIII 

- _ u - * * . 

MQLTILITE11AL SUBSTITUTION NITS VARIANT'S 

’ T. * 

- \ - ~~ Paragraph 
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More complicated types of qipher alphabets with variants 59 

Analysis of simple examples.... 60 

Analysis of more complicated examples 6l 

Analysis involving the use of isologs 62 

Further remarks on variant systems 63 



57. Purpose of providing variants In monoalphabetic substitution . -- 

a It has been seen that the individual letters composing ordinary 
intelligible, plain text are used with varying frequencies, some, such as 
(m English) E, T, R, I, and N, are used much more often than others, 
such as J, K, Q, X, and Z. In fact, each letter has a characteristic 
frequency which affords definite clues in the solution of simple mono- 
alphabetic ciphers, such as those discussed in the preceding sections of 
this text. In addition, the associations which individual letters form 
in combining to make up words, and the peculiarities which certain of 
them manifest in plain text, afford further direct clues by means of 
which ordinary monoalphabetic substitution encipherments of such plain 
text may be more or less speedily solved. This has led cryptographers to 
devise methods for disguising, suppressing, or eliminating the foregoing 
characteristics manifested in cryptograms produced by the simpler methods 
of monoalphabetic substitution One" category of such methods, the one to 
be discussed in this section, is that in which the letters of the plain 
component of a cipher alphabet are assigned two or more cipher equivalents, 
which are called variant values (or, more simply, variants ) . 

b. Basically, systems involving variants are multiliteral 1 and, in 
such systems, because of the large number of equivalents made available 



Uniliteral substitution with variants is also possible. Note the 
following cipher alphabet, illustrated by Captain Roger Baudoum m his 
excellent treatise, Elements de Cryptographie , p. 101 (Paris, 1939) 



Plain ABCDEFGHILMNOPQRSTUVXZ 
Cipher LGORFOAHCMBT IDNPUSYEWJ 
K X Z 

V 



Baudoum proposed that J p and Y p be replaced by Ip, K_ by C or Q p , 
W r, by W_ — thus four cipher letters would be available as Variants 



ll 



"by 

e high-frequency plaintext letters in French. 
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for 
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by the combinations and pexmutations of a limited number of elements, each 
letter of tlie plain text may "be represented by several multiliteral cipher 
equivalents which may be selected at random. For example, if 3-letter 
combinations are employed as the multiliteral .equivalents, there are 
available 2 6^ or 17*576 such equivalents for the 2 6 letters of the plain 
text, they may be assigned in equal numbers of different equivalents for 
the 2 6 letters, in which case each letter would be representable by 676 
different 3-letter equivalents, or they may be assigned on some other 
basis, for example, proportionately ijp the lelative frequencies of plain- 
text letters . For this reason th^s type of system may be more completely 
described as a monoalphabetic, multiliteral [substitution with a multiple - 
equivalent ciphci alphabet .^ Some author's term such a system ^simple 
substitution with multiple equivalents", others term it "monoalphabetic 
substitution with variants", or mult llit era 1 substitution with variants . 
For sake of brevity and precise terminology^ the latter designation will 
be employed in this text, Jt being understood without further zestatement 
that only such systems as are monoalphabetic will be discussed. 



c. The primary object of monoalphabetic substitution with variants 
is, as has been mentioned above, to provide several values which may be 
employed at random in a simple substitution of cipher equivalents for the 
plaintext 2 etters 

d. A word or two concerning the underlying theory of (monoalphabetic) 
multiliteral substitution with variants may not be amiss . Whereas in 
simple or single -equivalent substitution it has been seen that 

\ “ 

(1) the same letter of the plain text is invariably represented by 
but one and always the same character of the cryptogram, and 

(2) the same chai acter of the cryptogram invariably represents one 
and always the same letter of the plain text, 

- - - t 

in multiliteral substitution with variants it will be seen that 



(1) the same letter of the plain text may be represented by one 
or more different characters of the cryptogram, but 

(2) the sane charactei of the cryptogram nevertheless invariably 
represents one and always the same letter of the plain text. 



58. Gimp It. I ties of cipher alphabets with variants . — a. The mat- 
rices shown on the next page provide some of the simpler means for 
accomplishing monoalphabetic substitution with variants. The systems 
incorporating these motijice? aie extensions of the basic ideas of multi- 
litcial substitution treated in par 52 The variant equivalents for 
any plaintext letter may be chosen at vill, thus, m Pig. 26, Ep=10, 15, 
60, 01 65, m Fig. 27, Fp=AU c , tZ c , FU C , EZ C , LU C , 01 LZ C , etc 



Ci . the title of the picceding section, "f ult ill 1 eral substitution 
with s ingle -equival ent cipher alphabets M 

■ ■ ■ < • f** — !* V** 
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Figure 2 6 
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Figure 29 



Figure 30 



Figure 31 
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Figure 33 
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b. It is to be noted that encipherment by means oC the matrices in 
FIguios 27# 28, and 33 is commutative , i.e,, ho coordinates may he road 
in either row-colpipn or column -rc^r order witluut crypt ographic onibiguj by, 
since ilxere is no "duplication hetxrecn the ro*f cypd column coox dins Lea , 

The rcmaininG matricos above are ;npn-commutp.tive # therefore a convention 
must he a£ieed upon as to the older of reading the coordinates . It should 
also he noted that m Figs. 30 and 31 the letters in the square have been 
inscribed in such a manner that# coupled" with the particular arrangement 
of the row and column coordinates, the number of variants available for 
each plaintext letter is ^roughly proportional io the frequencies of the 
letters in plain text A sunilir idea ib found in Fjg. 35# vheiem the 
top row of the rectangle contains ja woid composed of bigh-lrequency let- 
ters, and the coordi nates are arranged in a manner roughly corresponding 
to the frequencies of plaintext letters. The matrix In Fig. 28 is a mod- 
ification of the pseudo-code system described m par. 52h, with the added 
feature of variants . ^ ^ ~ , 

1 c. Other simple ideas’ for producing variant systems are matrices 
such as the following ' 1 1 



II I-J K 



U V 



08 09 10 11 *32 13 l4 15 16 -17 38 10 20 21 22 23 2 b 25 01 02 03 04 05 06 07 

35 36 37 38 39 to hi 42 >13 44 I15 *i<5 I17 k 8 49 50 2 6 27 28 29 30 31 32 33 

68 69 70 71 72 73 7*i 75 51 52 53 5 l i 55 56 57 58 50 60 61 62 63 65 66 67 

87 88 89 90 91 92 93 9b 95 96 97 98 99 00 76 77 78 79 80 81 82 83 84 85 86 



Figuie 36 



D E F G 



S~> r- 

Q R 



>< 1 

T U 



«♦ 

U X 




ABODE FGH.JJKLM II OPORSTUVUXYZ 



l4 15 16 17 lO 19 20 21 22. 23 24 25 26 01 02 03 04 05 06 07 08 09 10 11 12 13 
27 28 29 30 31 32 33 3 b 35 t 36 37 38 39 ! i0 4i 42 43 44 45 46 47 *i8 49 50 51 52 
58 59 60 61 62 63 Ck 65 66 67 68 69 70 71 72 73 7b 75 76 77 78 53 5**- 55 56 57 
81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 9 f 98 99 00 & 52 W////////A 79 80 



\ 

m , Figure 37 i\ 

In those two m tuces there has been s regular inscription oi the dinomes 
m the 1 own • 1 1 thermore, m F1 g 36 the dinomes 01, 26, 51, and 76 (1 o., 

the lowest number in each of the foul sequences) give the Fey void (TRIP) 
foi that matrix, and ip Fig 37# the dinomes 01, 27# 53# end 79 denote 
the ley woid (NAVY) for that matrix The secur ivy of systems involving 
such matrices would of course be greatly iiipxoved 1 the dmores nere 
assigned m a random mannei but tlicn fbe easy mnemonic iesture of the 
foul sequences and the lev woid would be lost. 
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a. n.n Interesting adaptation in_a, disc form of bho typo of matrix 
illustrated in Fl£. 37 is the following device reputedly once used by 



the Mexican Ainiy 




V * 

"i i 









I - XX > t 1 

1L- 

The device consisted of five -concentric discs, the outer disc bearing the 
2 6 letters of the alphabet, and the other four bearing the sequences 
01-26, 27-52, 53-78, and 79-00* The rotatable discs made it possible to 
change the keys at frequent intervals, without the necessity of writing 
out a new matrix each time 



59 More complicated types of cipher alphabets with variants .— 

'”a. Matrices such as those m Figs . 38 , 39, and 46 below are termed 
frequential matrices , si ice the numbei" of cipher values available for any 
given plaintext letter closely approximates its relative plaintext fre- 
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Figure 39 
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figure 40 



b . In the fragmentary matrix illustrated in Fig. 38# the number of 

occurrences of a particular letter within the matrix is proportional to 
its frequency in plain text, the letters are inscribed in a random manner, 
in order to enhance further the security of the system. In Fig. 3 9j we 
have a modification of the idea set forth in Fig. 38, except that the 
size of the matrix has been reduced from 2.6 <2.6 to 10x10, in this case, 
the letters (with appropnate number of lepetitions) have been inscribed 
in a simple diagonal route ( lower left to upper right) within the square, 
and the coordinates have been scrambled, for greater security. In Fig. 40 
there is illustrated a type of cipher square which is known in crypto- 
logic literature as the Grandpre cipher , m this square there are in- 
scribed ten 10-letter words containing all the letters of the alphabet in 
their approximate plaintext frequencies These ten words are further 
linked together by a 10-letter word which appears vertically in the first 
column, as a mnemonic feature for the inscription of the words in the 
rows . • 1 «* r > it t ti i F 1 f. 

c. The frequential-type system represented in Fig. 4la (enciphering 
matrix) and 4lh (deciphering matrix) was described by Sacco?, who pro- 
posed that the dinomes inscribed in the enciphering matrix be thoroughly 
disarranged by applying a 'double transposition to the dinomes 00-99 as a 
means of suppressing oiiy patent relationships among the variant values 
for the variouc plainte* ct letters, furthermore, 'the nulls incorporated in 
the matrix were to be used "occasionally during the encryption of a mes- 
sage, in order to throw a cryptanalyst off the irack In this example the 
number of variant vplues for each plamte 't letter hos been established, 
of course,’ from the standpoint of Italian letter frequencies. 

Nr -f- m. ,1 j 4 j 

. 1 <■ ' i - , « • * 1 

Q **-*-„- . 

Sacco, Gencrale Luigi, IfonuaJe di Cnttografis , 3d Ed., Rome, , 

p. 22. “ ~ " 

I F 1 ' 
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d. The Baconian cipher described in par. 53a may he used as a basis 
for superimposing, additional complexities For instance, the "a" elements 
may he represented by any one of the 20 consonants as variants, while 
the "b" elements may he represented by any one of the six vowels, or the 
letters A-M may he used to lepresent bhe_ "a" elements and the letters 
N-Z for the "b" elements, digits may be used for the "a” and "b" elements, 
either on the basis of the first five and last five digits, or on the 
basis of the odd and even digits, or the first 10 consonants (B-M) and 
the last 10 consonants" (W-Z) may be used for the “a" and "b" elements, 
with the vowels used occasionally as nulls --thus the resultant crypto- _ 
grams will resemble those of a fairly complex cryptosystem However, 1 ' 
once the cryptanalyst assumes the possibility of such a system, its com- 
plexity is more appoient than leal Similarly, variations of this genre 
maybe superimposed on triliteral systems such as the Trithemian' cipher 
illustrated In par. 53b; valiants for the "1", "2", and " 3 " elements may 
be chosen m such a way as to provide a large number of equivalents for 
each basic triliteral combination 



£. Another scheme for e complex variant system is a summing -tr inome 
system . In this cryptosystem, each plaintext letter is assigned a unique 
value of 1 to 2 6 , this value 13 then expi essed as a tunome, the digits 
of which sum to the designated value of the letter . For example, if a 
letter "has been* assigned the value ’'4", it may be represented by any one 
of the following permutations and combinations 11 



oo4 
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112 


202 


301 
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o4o 


121 


211 


310 


** 


022 


103 


130 


220 


400 





Since the values toward the middle of the iange 1-26 maybe represented 
by a very considerable number of summing -tnnomes (e.g., for the values 
13 and 14 there are 75 variants each) , such a system would offer a crypto- 
grapher wide latitude m the choice of cipher equivalents in enciphering. 



jj; -- - - - 

The representations of an integer (i.e., a whole number) as the sum 

of integers in all possible ways $re termed thg partitions of that num- 
ber. The partitions in this subparagraph are mod 10 and olso include the 
digit 0 in order to form trinome equivalents out of^ all the possible per- 
mutations. ^ l ~ 
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especially if the "basic values *of the plaintext letters were chosen to 
correspond with the scale of their relative frequencies, such as the fol- 
lowing, 7 

BWYU3PHDI ONETRlsL OPMG-VXKZ 
3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 32 23 24 25 36 37 

m m g % u s s & 

1 gnUgg gg g g g%mwn m g 

\ Tr r*m:m t m~k gg g m'gg g zmgg m - , 

ggmm 

— g g g g g g & g^ g — 

. ggg g g z 

, , *g~%gg-g\% 

‘ > r ' — - ^ p ^ ^ ^ 

o/ ** * *, v fef^ g i? ^ ^ g ^ = 

«- *■»- - * - ' ■= g = 

■ ■ — r — 
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.AWl 
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/ > 



The tallies "beneath each value represent the number of variants possible 
for the particular value The unused values for 0 and 27 (uniquely re- 
presented by 000 and 999 > respectively may be used for punctuation marks, 
nulls, or other spec lal -puj^o s e symbols! Since such a system, once sus- 
pected, would offer lxtt ] 4 difficulty^ to a cryptanalyst, certain modifi- 
cations would be necessary m order Jo pose any real obstacles m the way 
of solution. For instance, if the numerical value of a letter is expres- 
sed by permutations of 3_ letters (instead of digits) out of a set of the 
10 letters A-J wherein the sequence W the letters A-J represents a dis - 
arranged sequence of the digits $-9, such a system may be among the most 
complex types of ciphers, m the realm of monoalphabetic substitution, 
requiring the solution of many simultaneous equations , A further refine- 
ment would involve the use of all 26 j.etters as variants, in predetermined 
groups, to represent the digits ^-9 Fortunately for the cryptanalyst, 
such systems are impracticable for i’Slid military use, but if they were 
encountered, a sufficiently large volume of text, coupled with Hitt’s 
four essentials quoted m Section I, would eventually make a solution 
possible. The i actual crypt analytic complexity of certain apparently 
exceedingly cqmplex cryptosystems is dep/ndent on their being correctly 
used at all times, which is not invariably the case with military ciphers. 







' The solution would involve simply dividing the cipher text into 
groups of 3 digits, summing the trinomes thus produced to yield 28 pos- 
sible basic values, and solving these "basic values as m any simple mono- 
alphabetic substitution cipher. 
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60. Analysis of simple examples , 
available Tor study 



-a. The following cryptogram is 
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The first thing that strikes the v eye is the total absence of vowels, re- 
markable not only because six Testers are missing (cf. the A test) in a 
text of this size, but also because'al] six of these letters fall into 
an identical limited category— a significant non-random phenomenon. Since 
a uniliteral substitution alphabet with six letters missing is highly 
improbable, the conclusion of multiliteral substitution is obvious. 

Upon closer inspection it is found that, if the cipher text is divided 
into pairs of letters, only ten consonants (BDGJLNQSYX) are used 
as prefixes, and the remaining ten consonants (CFHKMPRTWZ) are 
used as suffixes — thus the biliteral (and bipartite ) characteristics of 
the cipher text are disclosed. A &igraphic° distribution is therefore 
constructed ' * »*• 

- ^ -* 
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\ If it had not been noticed that the cryptogram should be divided 
into, pairs for analysis, a biliteral distribution (see par. 23d) might 
have peen made, in order to reveal contact affinities of the cipher let- 
ters. = "* ’ 
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b, It is poo £5 lble that the cr^-ptogram under study may involve Ihc 
use of a small enciphei mg matrix with. variants for the rows and columns . 
Since there is available an easily-applied special solution whach permits 
the detei miration of the roy indicators which are equivalent (i.e., inter- 
changeable variants) and the column indicators which are equivalent, mere- 
ly from a study of the digraphic distribution, this possibility ns examin- 
ed. The special solution is based on the following considerations, m a 
message of moderate length for such a cryptosystem, it may be assumed 
that the various possible' cipher pairs for a given plain-text letter will 
be used with approximately equal frequency, for this reason, the cipher 
letters which pair with one of the letters used to indicate any particular 
row of the enciphering matrix may be expected to pair equally often with 
any other cipher letter which has been.uged to indicate the same row (and, 
of course, the same is true concerning the column- indicator letter-' 'I . 

Thus, in the digraphic distribution of such a cryptogram, sets of runs 
appear which have similar "profiles" and, likewise, sets of similar col- 
umns .7 First a study will be made of the rows of the distribution t just 
compiled, in ah attempt to locate and isolate those which match with each 
other, then, the'’same -trill be done with the "columns of the distribution. 

c. It is noted that the "L" and "V" distributions have pronounced 
similarities (Fig. 42a) — these rows came under consideration first because 
of their unique "heaviness" of their frequency characteristics. Likewise, 
the "D" and "N" rows have. homologous attributes m their appearance (Fig. 
42b) . However, the further grouping of the rows by ocular inspection may 
present difficulties to the student, since b -> may not yet trust his eye 
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Figure 42a 
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Figure 42b. 




in matching distributions , and he may feel _the need for some kind of 
statistical assurance. In the following subparagraphs there is given the 
technique of a more precise method for matching, roathemat ic al m nature. 






-7. 



^ These similarities are especially pronounced when the encipherer 
uses a "check-off" procedure for choosing his variants for each letter, 
that is, when he systematically "checks off" the variants used during 
encryption to msuie that all possible variants are used_xn approximately 
equal proportions . ~ " * ' "" 
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* d. This Method of matching in an attempt to "equate” interchangeable 
variants involves computing a separate value for each trial matching Of' a 
particular row (or column) against each of a series of other rows (or 
columns, as appropriate) --such a value is taken os an indication of the 
"goodness of match" exhibited by Ihe particular trial, the theory being 
that the correct match will produce the highest value The value fob a 
particular trial match is computed by multiplying the number of tallies 
in each cell of one row (or column) by the number of tallies in each cor- 
responding cell in the other row (or column) and then totaling the pro- 
ducts thus obtained. Because of the way in which it is produced, such a 
value is termed a "cross -products sum" . - ' - - 

- - i — n l 

e. In subparagraph £ above, it was determined that the "Jr and "V" 
rows were equivalent," and^ that the "D" and "N" roys„also formed an equiv- 
alent pair." The next ‘"heavy" row is the "G" row, this is to be tested 
for match with the five remaining unmatched rows . Let t$ie ”"G" row be 
tested first against the "B" row Inese two rows are given below, with 
their cross-products sum. For convenience, the cross-products sum is 
symbolized by ^(©^-,©2) , where ©1 and ©2 represent the designators of the 
distributions to be ''matched 9 



"G" 

HjjH 

*(G,B) 



222-3--1-1 
31113 22121 
622-3--i-i = 15 



- ji 



The complete table pf the comparisons of the "G" row with the five avail- 
able lows is as follows 



%(G,B) 

*(G,J) 

*(G,Q) 

2(G,S) 

*(G,X) 



6 2 2 
2 2 2 

- 4 - 
2 4 4 

- 2 - 



3 

3 

3 

6 

6 



l 

l 



1 

l 



1 z 



15 
11 
7 
17 
= 8 



The results indicate that the most probable match with the "G" row is the 
"S" row. 

f . Since the next "heaviest" row to be tested is the "B" row, its • 
matchings with the .three remaining rows are made, and are given below 



X(B,J) 3111124121 = 17 

X (B,0) -2- 2‘ 122-21= 12 

^(B,X) - 3 -1222-4 - = 12 



«•!*» i' 

_ «* 



» ' j 
* 



+*i 

*v. 



7? 1 - 

u In this conneotion, nole Lhe considerations treated m subpar. 60^. 

^ The Gicek 3etlei X (chi) is often used m cryptoDogy to symbolize 



matching operations 
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The concct matching of the B and J rows is indicated by the results* 
This 3 caves on]y tlie "Q" and "X" rows, which sic presumed to go together, 
since noi only is their cross-products sum satisfactory (when compared to 
the % values Tor some of the a the: rows which have been matched), but, 
equally important , thcii patterns oi crests and troughs are similar. 

Since we have not found more than two rows for any one set of inter- 
changeable values, It appears that the original matt ax had only five 
rows , with tiro variants for each row'. The rows of the distribution dia- 



combined 


m 1 


the following 1 




C 


F 


11 


K 


M 


P 


R 


T 


VI 


Z 


BJ 


4 


2 


2 


2 


2 


3 


4 


2 


3 


2 


• -BN 


8 


2 


8 


7 


2 


2 


2 


5 


7 


5 


GS 


3 


)| 


U 


- 


5 


1 


- 


1 


- 


2 


LV 


2 


8 


1 


7 


7 


8 


9 


6 


7 


7 


QX 


- 


3 


- 


_3_ 


3 


2 


2 


- 


3 


- 



Figure ^3 



g. Analysis of the distributions of the columns of Fig. 43 quickly 
reveals that columns "C" and "H" may be matched as a pair, and likewise 
columns "F" and "M", and columns "P" and "R" . In order to decide the 
groupings of the remaining columns, the six possible ^ values are der- 
ived. 



X(K,T)« 


4 


35 


— U2 — z 


81 


X (K,W)- 


4 


49 


- 49 9 = 


113 


£(k,z) 


4 


35 


- 49 - = 


88 


*(*,») 


6 


35 


- 42 - = 


83 ’ 




4 


25 


2 42 - = 


73 


J(w,z) 


6 


35 


- 49 - = 


90 



Combinations 
KT, UZ 81+90 
TZ* 113 
TU 88 



Ktf. 



KZ, 



+ 73 
+ 83 



171 

186 

171 



It appears that the proper pairings of the columns are 
and "Z" 






and "W", "T" 



t 



^ h. The groupings of' the cblumns having been determined, the fre- 
quency diagram is reduced to its basic 5x5 square, and the <b test is 

H e “ r *" * ■* t »■*•«.* i 



C F K‘ P 
H M W R 



$ 

Z 



BJ 


6 4 5 7 


DR 


16 4 ill k 10 


GS 


7 9-13 


u. LV 


3 15 i4 17 13 


, 0 X 


- 6 6_4 - 


r 





<^=1962 

<^=1132 

<j> o :l670 



taken as further statistical assurance of the, matchings . Although <|> 0 in 
this case does not come up to the best expectations , we feel nevertheless 
that the matching has been carefully and correctly accomplished, and so 
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tlie next step is continued, with a conversion of the multiliteral text 
into uniliLeral equivalents, ■using the following reduction square con- 
taining on arbitrary sequence 





C 


F 


K 


P 


T 




H 


I! 


\1 


R 


Z 


BJ 


A 


B 


C 


D 


E 


BN 


F 


G 


H 


I 


K 


GS 


L 


M 


IT 


0 


P 


LV 


Q 


R 


S 


T 


U 


Q X 


V 


W 


X 


Y 


Z 



The converted cryptogram is now easily solved, using the principles set 
forth in Section VI . The first fifteen letters of the plaintext message 
are found to read "IJEATHDR FORECAST . and the original enciphering 

matrix is recovered, based on the key word ATMOSPHERIC, as follows 





P 


F 


C 


K 


T 




R 


M 


H 


W 


Z 


LV 


A 


T 


II 


0 


S 


DN 


P 


II 


E 


R 


I 


BJ 


C 


B 


D 


F 


G 


GS 


K 


L 


N 


0 


U 


OX 


V 


W 


X 


Y 


z 



i. The method of matching rows and columns just described in the 
preceding subparagraphs applies equally well to all the matrices in Figs . 
26-35, and similar variations If in the process of equating indicators 
the cryptanalyst sees that the row Indicators are falling into the same 
groupings as the column indicators, he might be able to accelerate the 
equating process by taking advantage of this feature alone, as would be 
the case if he had encountered a cryptogram involving a matrix with indi- 
cators arranged m a manner similar to that shown m Figs 29 and 30. 
Furthermore, a cryptogram enciphered m a commutative system, wherein 
the equivalents have boon taken m row-column and column-row order indis- 
criminately, may be recognized as such through a study of the digraphic 
distribution or the cryptogram since the " " row of the distribution < 

will have an appearance similar to the " " column, the " ft 11 row vill f 

he similm to the " P " column, etc, 1 - 0 this matter is discussed further in 
subpor 6ld. , r 



10 It is oiten convenient to use aibitrary symbols in cryptanalytic 
woik, to pi event confusion with designations of. p.cLpol elements of plain 
text, cipher text, or key (see footnote 1 on page 58 ). For this purpose 
Greek letters aro often used, for reference, the 2k letters of the Greek 

k. * * 

alphabet and their names are appended in the chait below 

~ ij - J — t #■ 



A 


<* 


alpha 


E 


E 


eps lion 


I 


1, iota 


N -v 


nu 


P 




ro 


$ 


t 


phi 


13 


P 


beta 


Z 


c 


^ela 


K 


n kappa 


x £ 


XI 


£ 


<r 


"ijiiia 


X 


X 


chi 


r 




gormu 


H 


•n 


cL 1 


A 


^ lajobd 1 


O O 


omicron 


T 


X 


tau 


Y 


i 


psi 


A 


_J_ 


fc LI 1 


G 


9 


the La 


A\ 


l*. mu 


n -rr 


P 3 


T 


V 


■»u. j Lon 


CL 


CO 


omega 
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j . 3 l * s impoi t ml to point out that in matching , 1 he cryptanalyst 
shoixld begin with the "best" rows or columns — best not only irom the 
standpoint of "heaviness" of the distribution, but also best from the 
point of view 5 t a distinctive pattern of crests and troughs. If insuf- 
ficient text is available bo allow equating all the intei changeable 
coo* dinn trs of a particular enciphering matrix, it may still bo possible 
that a convex sion of ine cipher text by meant of a partially-reduced re- 
cons b uct ion 1 matrix may yield enough ldioniorphic pot terns and other data 
to make pos*&ible an entry into the text. If the cryptographer hos not 
used a "check -off" process in enciphering, but instead has favored certain 
equivalents for the various plaintext letters, matching may not be pos- 
sible, nevertheless, an entry into the text may be facilitated in this 
case, because some oi the resultant peaks in the cipher text may bo cor- 
rectly identified Furthermore, since no variant system can possibly 
disguise the letters of low frequency in plain text, their low-frequency 
equivalents in the cipher text may provide possible approaches to solu- 
tion (See also subpar . 6 le) . 

k. In addition bo the method oi solution by matching and combining 
rows and columns of a diginphic distribution of a mul tiliteral cipher, 
theie is also the general approach applicable without exception to any 
variant system. This method, involving the correlation of cipher elements 
suspected to be the equivalents of specific but unknown plaintext letters, 
is treated in detail m paragraphs 6 l and 62. 

l . Systems such ns the ll-le-vel dmome cipher illustrated in Fig. 36 

are susceptible to a very easy solution, if the dinomes have been in- 
scribed in numerical order •'dioatod. Assuming such a case m a 

specific cryptogram, the first six groups ■> 1 ’ which are 

» 

68321 09022 *i 8057 65111 88648 42036 .. 



a four-part frequency distribution of the entire message, is taken as 
illustrated in Fig. 4 ^ below 



01 02 03 04 % 06 07 08 09 10 H ,2 13 14 15 lb 17 18 19 20 21 8 23 24 25 
26 27 28 29 30 31 5 33 3 ^ 35 5 37 38 39 to 4 l to 43 44 45 46 ^7 l i8 49 fo 



51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 1 68 69 70 71 72 73 75 



76 77 78 79 80 81 82 83 Si % 86 87 88 89 90 91 92 93 9^ 95 96 Iff 98 99 00 

Figure kk . 



RESTRICTED 



135 



I 




REF ID:A56892 






REOTinCTED 

i If the student will bring to bear upon thi^ problem the principles he 
learned m Section V of this text, he Kill soon realize that what he now 
/ has before him are four simple , mono alphabetic frequency distributions 
similar to those involved in a monoalphabet ic substitution cipher using 
i standard alphabets . The realization of this fact/ immediately provides 
the clue to the next step* "fitting each of the/distributions to the 
normal". (See par. 3l) • This cgm be done without difficulty in this 
case (remembering that a ^-letter alphabet is /involved and assuming that 
I and J are combined) and the following alpho.be ts result 



01— I-J 


2.0— U 


51— N 


76— E 


02— K 


27— V 


52—0 


77— F „ 


03— L 


28— W 


53— P 


78-R3" 


04— M 


29— X 


54— Q 


79— H 


05— N 


30— Y 


55— R 


80— I-J 


06—0 


31— Z 


56— S 


81— K 


07— P 


32— A 


57— T 


82— L 


OS— Q 


33— B 


58— U 


83— M 


09— R 


34— C 


59— V 


84— N 


10— S 


35— D 


60— W 


85—0 


11— T 


36— E 


61— X 


86— P 


12— U 


37— F 


62— Y 


87— Q 


13— V 


38— G 


63— Z 


88— R 


14— W 


39— H 


64— A 


89— S 


15— X 


40— I-J 


65— B 


90— T 


16— Y 


41— K 


66— C 


91— U 


17 — Z 


4 2— L 


67— D 


92— V 


18— A 


43 — M 


68— E 


93 — W 


19— B 


44— N 


69— F 


94— X 


20— C 


45—0 


70— G 


95— Y 


21— D 


46— P 


71— H 


96— Z 


22 — E 


47— Q 


72— I-J 


97— A 


23— F 


48— R 


73— K 


98— B 


24— G 


49— S 


74— L 


99— C 


25— H 


50— T 


75— M 


00— D 



The key word is seen to be JUNE and the beginning of the cryptogram is 
deciphered as "EASTERN EETRAECE " 

m. If instead of 25 -element alphabets, e system such as that in 
Fig. 37 has been used, onl/ a slight modification of the procedure in 
subpaiagiaph j_ would have been necessary, i.e., the distributions would 
have had to be consideied on a basis of 2 6 , and the process of fitting 
the distributions to the normal would have gone on as m the previous 
example . 
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n. One Ciu Lher application of principles learned 3 n Section V de- 
serves to "be mentioned here, in connection with the solution of system 0 
3 uch as those of Fig. 36. Let the following short message he considered 

h 8 2 2 6 8 8h 2 0 9 9 U 3 Go* 7 6 0~5~9 cf 9 6 5 i 

36683 52267 9 7 11 ii 5 b k 6 6 76 

t - * - - - * ’ » 1 *; , 

If it is known that the correspondents have been using lx variant system 

such as that in Fig. 36, a special solution may "be employed in those cases 
wherein there is insufficient cipher text to permit analysis "by the method 
oi fitting the frequency distribution to the normal. Thus, a short cryp- 
togram may be solved by a variation o C the plain-component completion 
method described in par. 34 First, let the cryptogram bo copied in 
dinomes, with an indication of the level (i.e., the "alphabet") the di- 
nonle would occupy m the 1 1 — level matrix, thus 



t<8 22 68 84 23 52 09 99 36 04 76 05 90 5 ^ 51 36 68 35 22 67 97 11 45 4 >i 66 76 
2 13 4 13 1 4 2 1 “5 14 3 3 2 3 2 1 3 4 1 2 2 3 ' 4 

■-« x . r * 

The dinomes belonging to the four levels are as follows 

r — ‘ ^ 1 

(1) 22 23 09 04 05 22 11 

(2) kQ 36 36 35 45 41* 

(3) 68 52 56 51 68 67 66 

( 4 ) 84 99 76 90 97 76 ’ 

“ ■* 



f h 



t * •* 

» r ' 



t - 



1 4 „T 

These dinomes are converted ^into terms of the plain component by setting 
each of the cipher sequences against the 'plain component at an arbitrary 
point of coincidence, such^s in the following' example 

^ - / r k ~ r ‘ * 

ABCDEFGHal-JKLMNOPQrBrSTUYWXTZ 



01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 
26 27 28 29 30 31 32 33 34 35 36 37 38 39 ll 0 4 i 42 43 4 ?i 45 46 I17 *18 49 50 
51 52 53 9 b 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
76 77 78 79 80 81 82 83_84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 00 



(1) 


22z\T, 


23 =x. 


09 = 1 , 


(2) 


48 =X, 


36 =l, 


36:L, 


(3) 


68:S, 


52=B, 


56zF, 


(4 


84:1, 

LI 


9 9=Y, 


J6=A, 




» e- 4 , 


f ^ “ 





OhzD, 


05 =E, 


22 rW , 


U=L 


35 =K, 


^ 5 =U, 


44 :T 




53 =A, 


68:S, 


67 =R, 


66 


90 =P, 


97 =U, 


76 :A 





f e 



■*r - 



«> 



r 



1 t 



1 - 1 - It should be clear to the student that the reason this method can be 
applied in this instance is that both the plain component (ABC.... Z) and 
the cipher co^onent ( 01 , 02 , 03 ..... 29 , 26 - 90 , 91 - 79 , 76-OO) are knotm 
sequences (or thus assumed) . 
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o. The plain component sequence is now completed on the letters of 



the four 


levels , 


as 


follows • 














1 


- 








>• 




f 




1st level 






2d level 




3d level 

4 


w 


4th level 


W 


X 


IDE 


W 


L 


X 


L 


L 


K 


U T 


S 


B 


F 


A 


s 


R 


Q 


I 


Y 


A P W 


A 


X 


Y K E F 


X 


M 


Y 


M M L 


V 


U 


T 


C 


G 


B 


T 


S 


R 


K 


Z 


B Q X 


B 


Y 


Z 


X, F G 


Y 


N 


Z 


N 


N 


M W 


V 


U 


D 


H 


C 


U T 


S 


L 


A 


CRY 


C 


Z 


A 


M G H 


Z 


0 


A 


0 


0 


N 


X 


w 


V 


? 


I 


D 


V 


U 


T 


M B 


D S Z 


D 


A 


B 


N H I 


A 


P 


B 


P 


P 


0 


Y 


X 


W 


F 


K E 


U 


V 


U 


N 


C 


E T A E 


B 


C 


OIKB 


Q 


C 


Q Q 


P 


Z 


Y 


X 


G 


L 


F 


X W V 


0 


D 


F U B 


F 


Q 


D 


P K L 


C 


R 


D 


R 


R 


Q 


A 


Z 


Y 


H 


M 


G 


Y 


X R 


P 


E 


G V C 


G 


D 


E 


QLM 


D 


S 


E 


S 


S 


R 


B 


A 


Z 


I 


N 


H 


Z 


Y X 


Q F 


H W D 


H 


E 


F 


R M If E 


T 


F 


T 


T 


S 


C 


B 


A 


K 


0 


I 


A 


Z 


Y 


R 


G 


I X E 


I 


F 


G 


SNO 


F 


U 


G 


u, 


,u 


T 


D 


C 


B 


L 


P 


IC 


B 


A 


Z 


S 


H K Y F 


K 


G 


STOP 


G 


V 


H V 


V 


U 


E 


D 


C 


M 


0 


L 


C 


B 


A 


T 


I 


L Z G 


L 


H 


I 


UPQ 


H W 


I 


V 


w 


V 


F 


E 


D 


N 


R 


M 


D 


C 


B 


U 


K 


M A H M 


I 


K 


V Q R 


I 


X 


K 


X 


X 


w 


G 


F 


E 


0 


S 


N 


E 


D 


C 


V 


L 


HBI 


N 


K 


L W R S 


K 


Y 


L 


Y 


Y 


X 


H 


G 


F 


P 


T 


0 


F 


E 


D 


w 


M 


OCK 


0 


L 


M 


X S T 


L 


Z 


M 


Z 


Z 


Y 


I 


n _ 


G 


0 


U 


P 


G 


F 


E 


Y 


N 


PPL 


P 


M N 


Y T U 


M 


A 


N 


A 


A 


z 


K 


1 


II 


R 


V 


Q 


H 


G 


F 


Y 


0 


Q E M 


Q 


N 


0 


Z U V 


N 


B 


0 


B 


B 


A 


L 


K 


I 


S 


u 


R 


I 


H 


G 


Z 


P 


R F N 


R 


0 


P 


A V W 


0 


C 


P 


C 


C 


B 


M 


L 


K 


T 


X 


S 


K 


I 


H 


A 


Q 


S G 0 


B 


P 


Q B W X 


P 


D 


0 


D 


D 


C 


N 


M 


L 


U 


Y T 


L 


K 


I 


B 


RTHP 


T 


Q 


R 


C X Y 


Q 


E 


R 


E 


E 


D 


0 


N 


M 


V 


Z 


U 


M 


L 


K 


C 


S 


U I Q U 


R 


S 


D Y Z 


R 


F 


S 


F 


F 


E 


P 


0 


If W 


A 


V 


N 


M 


L 


D 


T 


V K R V 


S 


T 


E Z A 


s. 


.G 


_ ,T, 


,G 


G 


F 


Q 


P 


0 


X 


B 


vr 


0 


N 


M 


E 


U 


W L S 


W 


T 


U 


FAB 


T 


H 


U 


H 


H 


G 


R 


0 


P 


Y 


C 


X 


P 


0 


N 


F 


V 


X M T 


X 


U 


V* 


G B C 


U 


I 


v; 


'i 


I R 


S 


R 


Q 


Z 


D 


Y 


Q 


P 


0 


G W 


Y N U 


Y 


V 


w 


H C D 


V 


K 


w 


K 


K 


I 


T 


S 


R 


A E 


z 


R 


Q 


P 


H X 


Z 0 V 


Z 



It is seen that the Generatrices with the best assortment- 1 ^ of high- 
frequency letters for the four levels are 

1st level 2d level 3d level 4th level 

BIHim'5 RE. E DON Ej)SNEDC HCETAL 



In evaluating generatrices, the sun of the arithmetical frequencies 
of the letters in each row may he used as an indication of their relative 
"goodness". A statistical] \ much more accurate method ol evaluating 
generatrices invqlves the use of logarithms of the probabilities of the 
plaintext letters forming uhe ge^eraLt ices . (bee also footnote 7 on 
page 89 ) - ' * 
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If tlie lettcis of these Generali Ices aie an anged xn the oidcx of appear- 
ance of tlieir dinomc equivalents, according to I he way Ihey fall into the 
various levels, 

U8 22 68 8ji 23 52 09 99 36 oh 76 03 90 56 51 36 68 35 22 67 97 11 jj£ 4j) 66 76 

E I< R fl N L T 

R E ED OH 

E 0 SHED C 

N - C E T A E 

the plain text "REEHFORCEMEWTS HEEDED AT ONCE” is clearly seen. Or, ihore 
simply, IT we examine the equivalent s of 01, 2 6, 51, and 76 after the 
generatrix determination has been made, the key word JURE is levealed. 

If an error had been made m the selection of a generatrix, the error 
could he resolved hy hypothesizing the probable key word, or by decipher- 
ing the text on the basas of the assumed diagiam and then noting and 
degarbling the systematic errors (which, it would be noticed, all come 
from one level) 

p. The student should note that no one generatrix will yield plain 
text all the way across as in the example in par. 3^ Instead, the 
generatrices must be considered separately for the four levels, since it 
is within each of the four levels that there is a homogeneous relation- 
ship of dinojnes ^ Obviously if dinomes from more than one level were used 
to complete the plain component sequence, the generatrices would not con- 
sist of a homogeneous group of letters but instead would represent an 
assortment of letters' from tiro or more "alphabets" 

6l. Analysis of moie complicated examples . — -a As soon as a begin- 

ner in cryptography realizes the consequences of the fact that letters 
are used with greatly varying frequencies m normal plain text, a bril- 
liant idea very speedily comes to him Why not disguise the natural fre- 
quencies of letters by a system of substitution, using many equivalents, 
and let the numbers of equivalents assigned to the various letters be 
more or less m direct piopoi_lio.fi lo tne noimal frequencies of the let- 
ters’ Let E, for example, have 13 equivalents, T, 9> H, 8, etc., and 
thus (he thinks) the enemy cryptanalyst can have nothing in the iray of 
telltale or character is tic fi equencies to use as an entering wedge 

b. If the ted available for study ip sm ill in amount and if the 
variant values are wholly independent of one aftother, the problem can 
become exceedingly difficult. But m practical military communications 
such methods are rarely encountered, because the volume ol text is usually 
great enough to permit of the establishment of equivalent values . To 
illustrate what is meant, suppose a number of cryptograms produced by 
the monoalphabetic -variant method described above show the following 
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two sets of gi oupinco’'3 or cipliei elements in the text. Set "A" being 
assumed to be diffeient representations of one particular underlying 
plain text, and Sol "B" ass tuned to be representations of another under- 
lying plain text 

Set 11 A” Seb "B” 



(12-37-02-79-60-13-03-37-7 r) 
(82-69-02-79-13-68-23-37-35) 
(82-69-53 -16-13-13-78 05-35) 
(91-05-02-01-68 -42-78-37-77) 



(71-12-02-51-23-05-77) 
(11-02-51-O2-O3-O5-35) 
( 11-91-02-02-23-37-35 ) 
( 9 1 -12-51-02-78 -69-77 ) 



An examination of these groupings would lead to the following tentative 
conclusions with legard to piobable equivalc its 



(12, 82, 91) (02, 51) (13, *£, 68) (35, 77) 

(05, 37, 69) (01, 16, 79) (03, 23, 78) (13, 71, 97) 



The establishment of these equivalencies would sooner or 3 ater lead to 
the finding of additional sets of equal values. The completeness with 
which this can be accomplished wilY determine the ease or difficulty of 
solution Of course, if many equivalencies can be established the prob- 
lem can then be reduced practically to monoalphabebic terms and a speedy 
solution can be attained 



c. Theoretically, the determination of equivalencies may seem to be 
quite*" an easy mebter, but practically it may ~be very difficult, because 
the cryptanalyst can nevei be certain that a combination showing what 
may appear to be a variant va3ue is leally such and does not represent 
a part of a differenb plamteH sequence. For example, tale blie groups — 



17 -82-31-82- ill -63, and 
• ' -* - 27-82-Jl0-82-l4-63 

Here one might suspect tnot 37_and 2 J represent the same letter, 33. and 
ho another letter. ^But it .happens i^hat one group represents the word 
MANAGE, the other DAMAGE . , Thei e are^ hundreds of such cases in English 
and in other languages . ** * 

~ \ v f + c 



d. When reversible combinations 01 e used as variants, the problem 
is perhaps a bib more simple Tor example, using the accompanying Fig. U 5 

t > »» * 

> - +*** 



F, 7 

G, J 
C,N 
P,T 

Figuie I 15 



K,Z 0,Y B,H II,R D,L 
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^■8 qq 1G p lej t ctuoenc mi~ht be able to delei”uuc 
te' t of the two bc J s o' cijhei text gi oupir; o , 
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foz enc tplierment, Iwo messages with the same initial words, REFERENCE 
YOUR, may "be enciphered as follows 

R S ' F fi ' *5‘ " E H C ' Y 0 if R 

(1) N II U D R XLSHC D W U Z N RSL1P SRB JC H 

(2) CHDVR XS LEU Dff ZB H R L S II P RVJBS H_ 

The experienced cryptanalyst, noting the appearance of the very first few 
cipher groups, assumes that not only have the messages identical beginn- 
ings m their plain texts, but also that he is here confronted with a 
variant system involving biliteral reversible equivalents One of the 
manifestations of puch! a cryptosystem is that in the d5 graphic distribu- 
tion ol the cipher text the ”B" row will have an appearance similar to 
the "B" column, the h C" row will resemble the f ’C" column^ etc . , thus , 
the cryptanalyst will almost immediotely realize that fie has encountered 
a commutative system involving a matux smaller than that indicated by 
the size of matrix necessary for making the digraphic distribution . 

* 1 ~ ~ * 1 *1 ~ 
e The probable-word method of solution may be used, but v with a 

slight variation introduced because of the fact that, regardless of the 
system, letters of low frequency in plain text remain infrequent in the 
eiyptogram . Hence, suppose a word containing low-frequency letters, but 
m itself a rather common word strikingly idiomorphic in character is 
sought as a "probable word" , for example, words such as CAVALRY, ATT ACK, 
and PREPARE, Such a word may be written 6n a slip of paper and slid one 
interval at a time under the te db, which has been marked so that the 
high- and low-frequency characters are indicated. Each coincidence of a 
low-frequency letter of the text ./with a ^ow- frequency letter of the 
assumed word is examined carefully to see whether the adjacent 'text let- 
ters correspond in frequency 'with the other letters of the assumed word; 
or, if the latter presents repetitions, whether there are correspondences 
between repetitions m the cipher text and those m the word. Many trials 
axe necessary but this method will produce results" when the difficulties 
are otherwise too much for the eryptana] yst to overcome , 

------ „ 

62. Analysis involving the use of j sologs T--a. In military communi- 
cations it is not unusual that cryptograms are produced containing identi- 
cal plain text but which have been subjected to different cryptographic 
treatment, thus yielding different cipher texts . This difference in cryp- 
tographic treatment may be caused by the use of an entirely different 
general system, or by the use of a different specific key, or merely by 
the choice of equivalents m a variant system. Messages which present 
different encrypted texts but which contain identical plain text are 
called isologs (fiom the Greek iso = "equal" and logos = "word"). One of 
the easily-noted indications of the possible presence of isologs is 
equality or near-equality in the lengths of two (or more) cryptograms. 
Isologs, no matter how the cryptographic treatment varies, are among the 
most powerful media available to the cryptanalyst for the successful 
solution of a difficult cryptosystem — and, in some cases, may provide the 



l4l 



RESTRICTED 




REF ID : A56892 



»■» 

RESTRICTED 



only possible entries 3”3to a complex cryptosystem. An inkling of the 
help afforded by isologs was revealed by the example contained in sui^ar. 
6 Id above; however, a much more striking illustration is given in the 
next few subparagraphs . 

b. The following two cryptograms, suspected to be isologs, are 
available for study . « 



8 2 2 6 5 
8 0 2 7 7 

63629 

8 17 13 
38728 
89697 
2 8 12 0 
90870 

46729 



3 0 15 0 
45647 

90628 . 



3 5 19 9 

3 8 4 6 3 

26121 

0 6 4 8 l* 

4 4 10 5 

59682 





Message "A" 






63103 


7 4 8 3 9 


69842 


32529 


70115 


89106 


9 4 0 0 0 


13828 


5 4 0 8 2 


40065 


33918 


4 3 15 8 


8 l 0 4 8 


2 6 4 5 8 


4 5 0 3 9 


52538 


7 3 3 0 9 


20749 


6 17 5 2 


16476 


91147 


9 9 9 2 6 


4 1 4 6 8 


1 3 3 6 5 


3 3 8 8 1 


93816 


5 17 5 0 


5 7 0 7 4 


1 1 8 0 4 


4 3 2 5 5 


27730 


3 119 9 


7 9 9 6 2 


27865 


60653 


40867 
36245 - 


4 6 5 9 4 


,1 9 8 5 5 
* 


10822 


22987 


f 


t 


V 




► 


- - - 


Message B 


< 


- 


► V 1 

87497 


14 5 11 


97360 


49676 


50106 


99181 


69672 


53889 


41563 


25203 


77536 


2 0 3 5 1 


10570 


89277 


7 5 0 1 1 


90138 


99974 


50232 


0 4 1 1 5 


89216 


17547 


1 4 6 4 8 


0 0 6 4 6 


85864 


5 3 8 9 8 


83878 


94889 


3 3 7 2 8 


11272 


20504 


32103 


9 8 7 1 5 


4 2 6 6 2 


80760 


89880 


52900 

46253 


5 9 7 2 8 


22855 


8 7 3 0 0 


70893 



On the possibility that some dinome system (or systems) is involved, the 
messages are written under each other m dinomes to facilitate the exam- 
ination of the similarities and differences of such a grouping of the 
cipher texts, as shown on the next page 
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A 


02 


26 


56 


31 


03 


74 


03 


96 


98 


42 


32 


52 


97 


01 


15 


A* 


30 


15 


08 


7>l 


97 


i4 

-1 


51 


19 


73 


60 


49 


67 


65 


01 


06 


B 


80 


2T 


78 


91 


06 


94 I 


00 


01 


38 


28 


54 


08 


24 


00 


65 


B’ 


1| 5 


64 


79 


91 


81 


691 

1 


67 


25 


38 


89 


4] 


56 


32 


52 


03 - 


C 


63 


62 


93 


39 


18 


43 


15 


88 


10 


*48 


26 


45 


84 


50 


39 


C‘ 


90 


'62 


87 


75 


36 


20 


35 


11 


05 


70 


89 


27 


77 


50 


11 


D 


81 


71 


35 


25 


38 


73 


30 


92 


07 


49 


61 


75 


21 


64 


76 


D* 

% 


35 


19 


99 


01 


38 


99 


97 


>15 


02 


32 


o4 


11 


58 


92 


16 

a 1 


E 


38 


72 


89 


ll 


47, 


99 


92 


64 


14 


68 


13 


36 


53 


38 


81 


r ' E 1 


38 


46 


31 


75 


47 


l4 


6)1 


80 


06 


46 

v 


85 


86 

» 


45 


38 


98 

i 


F 


89 


69 


79 


38 


16 


51 


75 


05 


70 


74 


11 


80 


44 


32 


55 


F» 


26 


12 


18 


38 


78 


94 


88 


93 


37 


28 


11 


27 


22 


05 


o4 


- G 


28 


12 


02 


77 


30 


31 


19 


97 


99 


62 


27 


86 


56 


06 


53 


- G* 


06 


ll8 


43 


21 


03 


98 


V 


54 


26 


6 ? 


80 


76 


08 


98 


80 


* 
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98 


55 


10 


82 


22 


29 


87 


H’ 


l|>t- 


10 


55 


29 


00 


59 


72 


82 


28 


55 


87 


30 


07 


08 


93 


J 


46 


72 


93 


62 


45 
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- i 






1 J 


J* 


59 


68 


24 


62 


53 
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The dxnome distributions for the two messages are as follows • 





Distribution for 
Message "A" 



Distribution for 

Message "B” 

* 
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c. Since a gcneial absence of marked crests and troughs is no bod in 
"both 'distributions, if the divis.on of those cryplogiams into dinomes ,q.s 
corxcct, and if they are both monoalphabet ic, xt is quite probable bha-6' 
some typo of valiant system (ox systems) has been used. With this in 
mind, the encrypted te^Ls and their distributions ore scrutinized further 
foi some indication oC the kind of relationship which exists between the 
methods of enciphorment of the two messages. The distributions arc seen 
to be strikingly similar, not only ■with respect to the location of the 
one predominant peak m each, but also m the close correlation of the 
locations of the blanks in each Furthermore, upon examination of the 
superimposed messages themsetveo, it is observed that there are several 
instances wherein a value m message "A" coincides with the same value 
in message "B" (e.g , see positions A/A ' 1.4, b/b' 9 ) This observation, 
taken in conjunction with the marked similarity of the distributions, 
strongly indicates that not only has the same general cryptosystem been 
used for the encryption of both messages, but that the same enciphering 
m atrix has-been used for both. Also, in the case of the values 3$ and 
T>2., it is noted that wherever either occurs in one message the same va3ue 



( For the benefit of the student with a Mathematical background, it 
might be interesting to point out certain applications of cryptomathe- 
matics in connection vith these two distribut j ons . First of all, each of 
the two distributions is much flatter than that which would be expected 
for a sample of 125 du names of random text, i.e., a drawing (with replace- 
ment) and recording from an urn containing equal numbers of counters m 
each of 100 categories labeled 00-99 consecutively. In other words, 
whereas "random" follows a characteristic distributional appearance, ap- 
proximated by the normal or binomial distributions, the samples at hand 
exhibit phenomena even flatter (or "worse") than that expected for ran- 
dom, approaching the theoretical (and fantastically non-random) "equi- 
librium" of exactly the same number of tallies m each cell of a distri- 
bution. The following table gives the observed number of x-fold repe- 
titions in the two distributions, together with the expected number of 
x-fold repetitions in a sample of like size of random text, which expected 
number has been computed from tables of the Poisson exponential distri- 
bution (see MjlHary Cryptanalysis, Part III) 



X 


Observed 
Ilsg. "A" 


Observed 
Msg. "B" 


Expected 


71 


l4 


37 


29 


1 


51 


52 


36 


2 


33 


23 


22 


3 


1 


6 


9 


Ij 


- 


1 


3 


5 


] 


1 


1 



It is to be noted that m the distribution for Iiessage "A" the observed 
number of blanks (l^i) against the expected number cf blanJ s from random 
text (29) leprcsenlb a signnge or standard dev 10 1 ion of 2 er , which 
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occurs In the olhci message, a phenomenon explainable on the ssauitiplion 
thal the plaintext equivalents of the bgvo lues ore or such low frequency 
that no variant values have been provided for these plaintext letters m 
the ci yptosystem. *■ ' 1 

d. With the foregoing details determined, it is now icalircd that 
it should he possible to form, between the tiro mess aces, "chains" ot 
those cipher values which represent identic ol plaintext letters, os 
exempl if rod below. Beginning with the first value m each message , 82 
and 30, a partial chain of equivalent "variants is started, now locating 
some other occurrence of cither value elsewhere (e g., 82 at position 11*8), 
and noting the cipher value coinciding with it (in this case, hi) , the 
pari lal chain maybe extended (including now 82, 30, ond 4] ) , After this 
particular chain is extended to include as many values as possible, 
another chain is formed by starting with any value which hap not already 
been included in the preceding chain, this procedure being repeated until 



can be translated as odds of 368 to 1 against its occurrence by pure 
chance. Likewise the other entries besides 0 (m particular, the x-values 
of 1 and 2, and the cumulativp valu,es of 3 -end-better) may be evaluated 
m terms of sigmages, and the conclusion would be reached that the two 
distributions have a most remote chance of being as flat as they are 
through mere chance"^ for instance, it is 3*05 o' or 877 to 1 against dis- 
tribution ,f A ft having pnly two "tallies occurring three or rnSre times when 
13 such" tallies are expected' by iandom^— and this sigmage when taken Into 
consideration with that of the number of blanks yields a sigmage of 4 ©- 
or approximately 31,000 to 1 of occurring through" sheer chance. The sum 
total of all the deviations could be collectively evaluated, but this 
trould involve the laborious computation of a multinomial distribution. 
Since the distributions of the two messages are much worse than would 
even be expected for random chance, the conclusion is drawn that the 
dmome grouping is highly significant and therefore must be correct, and 
furthermore that the cryptosystem involves* variants m sufficient numbers 
for the plaintext letters to permit the_ encipherer to select the cipher 
equivalents with a v;lcw to suppressing as much of the phenomena of repe- 
tition as possible Secondly, the x tost of the" two distributions gives 
a X value of 206, as against the x value of 156 f or_ random samples of 
this size, this represents -a sigmage of_4.02 cr , or a ratio of 33,000 to 
1 against its happening by pure chance, i.e., if the cryptograms were not 
m the same general system and specific keys . Therefore it is a foregone 
conclusion statistically that not only ^do the cryptosystems involve di- 
nomes as the ciphertext grouping, but that the identical cryptosystem is 
involved m the two messages, and that because of, the close correlation 
o J f the patterns of the two distributions, there is a good probability 
that the cryptograms contain identical plain text and therefore are iso- 
logs This specific illustration of the potentialities of cryptomathe- 
matics indicates the important role that this branch of scjence may play 
in the art of cryptanalysis . 
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all possible chains are completed. It is found that the following chains, 
arbitrarily arranged hoze according to length, may be derived from the 
two messages 

(0 6 14 15 2 6 28 31 35 73 7 1 ! 8i 89 98 99) “ j - 

(02 07 20 22 43 44 6^ 90) 

(12 37 48 51 69 70 83 9 J 0 
(03 30 4l 54 65 82 97) 

(05 10 24 32 49 87 93) 

(16 18 38 76 78 79 86) 

(27 45 53 64 80 92) - - * 

(11 39 75 88 ) 

(21 58 77 84) 

(46 59 68 72) 

(00 52 67) 

(04 55 61) 

(08 29 56) 

(19 71 96) ... 

(01 25) Single dinomes 

(13 85) , - 

(42 60) (38) (47) (50) (62) (91) 



If we now make an arbitrary assignment of a different letter to represent 
each chain (and each single dinome) and convert either of the messages to 
uniliteral terms by means of these arbitrarily-assigned values, we note 
the pattern of the opeping stereotype "REFERENCE YOUR MESSAGE.....", and 
quickly recover the plain text. 

e. The plaintext values when inserted into a 10x10 matrix having 
arbitrarily-arranged coordinates yield the following. 

' iHn, - -r' 

0123 4 56789 

0 

1 

2 

"3 

4 

5 

6 

♦ 

7 

8 

_ 9 

Manipulating the,, rows and columns ^ ith a view to uncovering some symmetry 
or systematic phenomena, the latent diagonal pattern of the^equivalents 

*- 1 — l ^ r 
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for certain" of the betters (such as Ep, rip, Op, Rp, end Op) d s revealed, 
and tho^roi/s 'and columns of the reconstruction djogrTm arc permuted to 
yield the' foil owJ ng original enciphering matiix 

- 5 - ‘ > ‘ +t v -* » ' - 
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J - _ 1 
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- f* f J* • 1 7 
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~ ■» , 1 .* j 5.^6 , 
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V 


Y 


Z 



' •* ' , - p - a 1 ' * 7* - * , i >. 

There are^onq /jbserv-ible relationships in or between the sequences of. 
digits jji "thq row and coluitjp. coordinates , therefore for want of any 
visible phenomena or further information on the derivation (if any) of 
these digits, it. is assumed^ tjtfit they must have been assigned at random. 
The student ■trill note that t]?e final matrix is identical to that of 

Figure 39 in paragraph 59 • ' 

— ■» 

f . It should be emphasised that in the example of the preceding 
subparagraphs it was only possibleJLo form chains of values from both 
messages reciprocally because the same’ encipnermg matrix had been used 
for both. A non-reciprocal chaining procedure would have been required 
if only the general system had been the same for both but the enciphering 
matrices had differed in some respect, or if two completely different 
variant systems had been useci (eg, one using a frequential matrix and 
the other involving a less "6omplex type of variant matrix, such as Fig. 
29 ). Specifically, it would' have been necessary to maintain two sepa- 
rate groups of chains, one 'group for, each message, otherwise heterogeneous 
values would have become intermingled. 1 

* - - x ~ * r 

g. Although an analysis of but one isolated example by means of 
isologs was presented, the stqdent should b k e_able to appreciate the 
significance and potentially enormous value o T isologs to a cryptanalyst. 
This value goes far beyond the simple variant encryption in a monoalpha- 
betic substitution system, isologs produced by the use of tiro different 
code books, or two different enciphered code versions of the same under- 
lying plain text, or tiro encryptions of identical p]ain text; by tiro dif- 
ferent "settings" of a cipher machine, may all prove of inestimable value 
J.n the attack on a difficult cryptosystem 
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63 . Further remarks on variant systems . — a. A few yords should he 
added wit F rcgarct to certain subtei fuges Which" ore sometimes encounters^ 
In monoalpha.bSi.lc substitution with variants, and 'which, if not recog-~ fl 
rfized in time, cause considerable delays The considerations treated t 
before in subpars. 52 jL and ^ on the disguise of the length of the basics , 
imiltillteral group apply equally here to multiliteral substitution with 
variants, thus, in dmomc systems, a sum-checking digit or a null might 
be added in specified positions of the group to form a tnnome. In 1X0 ■ 
complex valiant systems, the presence of a null as one of the digits of 
a tnnome would add greatly to the complexities of cryptanalysis of that 
system. The most important of the subterfuges have to deal with the use 
of nulls which are of a different size than the real cryptographic units, 
inserted occasionally to prevent the cryptanalyst from breaking up the 
text into its proper units The student should take careful note of the 
last phrase, the mere insertion of symbols having the same characteristics 
as the symbols of the cryptographic text, except that they have no mean- 
ing, is not what is meant Jfhis class of nulls rarely achieves the pur- 
pose intended. Uhat is replly meant - can best be explained Jby an example. 
Suppose that a 5 x 5 variant matrix vith the row* an£L column "indicators 
shown in Fig. 4 6 is adopted for enciphemejit . formally, the cipher units 
would consist of 2 -letter combinations of the indicators, invariably 
giving the row indicator first (by agreement). 1 ' 
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TiGuie !i6 

The phrase COIMAHOiiR OF SPECIAL TROOPfb might be enciphered thus* 



COMMA I< DEROF 
VI EB PH IU FT IT AB Til UO P5I GT 



see 
e • 4 



These would normally then le anonged m 5 -tetter groups, thus 

v i e n p 11 1 u p t lriBi ii'ior’i g t . . . 
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It (l i ill l>o uo+ril* however, Ihit only 20 oi Tie 2^ It. I ter , of the 
oDplnlx t Ivfto lwii employed as n.sr md rolumn indicator.,, lnvin r J, Y , 

0, \, \ , nwl * upm e<l Pew, ''uppo , 'e 11k c r i lot lor ore used as r>u3 1*3 , 
notjtu p"Ul s » } ui a j ~ Inti njidnal letter irijCiLed it rarclcn juit before 
the ~i3 ail angeG m 5-lettci gioups Occasionally, *t pair of 

lettcva wotJhl "bo ( ni r, ci Lo<3,_ In or tie ’T to mask the charocteiistics of 
"avoidance” of llic e lelteis foi each other. Thus, for example. 

V I E X B I'HKIU F J X T I E A J B T MU OOP tJGKTjf 

Tile cryptanalys l , a^tcr some study suspecting abilitcrRl cipher, pro- 
ceeds to break up LI jo lex l into pairs 

VI EX BP Hh H) FJ XT IE AJ BT Ml 0Q I’ll GK TY 

Compare this set or 2~lctter u combinations with the correct set. Only 4 
of the 15 pairs ore "propel " uni ts It is easy to see tlial without a 
knowledge of the eristcnce of Ibe nulls — and even with a knowledge, if 
he does not lor «w which letters are nulls — the cryptanalyst would be con- 
fronted with a pioblem for the solution of which a fairly large amount of 
text might be necessary The corcful^emp] oyment oi the variants also 
very materially adds to the security of the method because repetitions 
can be rather effectively suppressed 

c. Similarly in the o amplcs undci paragraph 58, the letter J in 
Figs. 27 and 29 may be used as a null, the letter Y in Fig. 28, and the 
digit 0 in Figs 33 and 3 1 * In Fag 30, any letters m the range of 

P - Z might be used as nulls, out this U3a'e might be weak because of the 
extremely loir frequency of these letters as compared with the letters 
A - 0, this is an important point to consider in the examination of en- 
crypted text for possible poor ussges oT nulls 

d. From the cryptographic standpoint, usage of nulls m the manner 
outlined above result., in cr/pto u npnic Lc d even more than twice as long 
as the plain text, 1 lius constituting a serious disadvantage. From the 
cryptanalytic standpoint-, Ihe m -in of the cipher units in the system 
described in subper b above constitute.. the most important obstacle to 
solution, this, coupled nlh tne u ■> of variants, makes this system con- 
siderably more difficult to sot\e, espile its monoa] phobe ticity . 
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SECTION IX 

POLYGRAPHIC SUBSTITUTION SYSTEMS 



Paragraph 

General remarks on polygraphic substitution 614 - 

Polygraphic substitution methods employing large tables 65 

Polygraphic substitution methods employing small matrices. 66 

Methods for recognizing polygraphic substitution..,.. 67 

General procedure in the identification and analysis of polygraphic 

substitution ciphers 68 



69 

70 

Analysis of Plhyfair cipher systems 71 



Analysis of four-square matrix systems 
Analysis of two-square matrix systems, 



Analysis of pojLygraphic systems involving large tables, 



tut ion systems 
sisting of sink 
tinction betwe 2 
elements constjt- 
volving single 
involving ciphfe 
multiliteral .1 
''triliterai'* , 
that the 



72 



Further remark? on polygraphic substitution systems 73 

6 k. General remarks on polygraphic substitution . — a. The substi- 



plaintext 



dealt with thus far have involved plaintext units con- 
le elements (usually single letters). The major dis- 
n them has been made simply on the basis of the number of 
tuting the ciphertext units of each, i.e., those in- 
|-element ciphertext units were termed uni literal , and those 
rtext units composed of two or more elements were termed 
That is to say, when the terras "unillteral" , "biliteral" , 
Jste., were used, it was to have been automatically inferred 
units were composed of single elements. 



b. This 
involving plat 
systems 



are te 
terms "unillte 
rectly so — tha 
the compos it io 
term containing 



section of the text will deal with substitution systems 
otext units composed of more than one element; such 
rmed polygraphic , ^ 



(By comparing this new term with the 

jral" and "multiliteral" it may then be deduced — and cor- 
a term involving the suffix "-literal" is descriptive of 
|n of the cipher text units of a cryptosystem, and that a 
the suffix "-graphic" describes the composition of the 



See also 



subpar. 52a. 



Systems 
on this basis, 
connection with 
units of a syst 
unless otherwis 
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involving plaintext units composed of single elements may, 
be termed monographic; however, as has been stated in 
the terms "uniliteral" and "multiliteral", the plaintext 
em are understood (without restatement) to be monographic 
e specified. 
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p ] aintexfc units. ) Polygrapnic systems in vhj ch the plaintext units are 
cor posed of two elements axe called digi aphlc , these in which the plain- 
text units are composed of three elements ere trlgraphic „ etc. The 
ci p hertext units of polygraphia systems uBually"t onsist of the same 
number of elements as the plaintext units. l'hua, if a system is called, 
"di graphic", it may ho assumed that the ciphertext units of the system 
consist of two elements , as do the plaintext units; if this were not the 
case, the tern ’’digraph.! c" "by Itself would not he adequate to describe 
the system completely, and an additional modifying word or phrase would 
have to he used to Indicate this feet. 5 

jc» In polygraphic substitution, the combinations of elements which 
constitute the plaintext units are considered as indivisible compounds. 
The units axe composite in character and the individual elements compos- 
ing the units affect the equivalent cipher units jointly , rather than 
separately. The basic important factor in true polygraphic substitution 
is That all the letters of each plaintext unit participate m the de- 
termination of its cipher equivalent, the identity of each element of 
the plaintext unit affects the compos it ion of the whole cipher unib.o 
Thus, iD a certain digraphic system, ABp may he enciphered as XP C> and 
AL'p, on the other hand, may he enciphered as NK C ; a difference in the 
identity of hut one of the letters of the plaintext pair here produces a 
difference in the identity of both letters of the cipher pair.' 



In this connection, it is further pointed out that since the root 
"literal" derives from the Latin "liter a", it is conventionally prefixed 
by modifiers of Latin origin, 3uch as "uni-", "hi-”, and "multi-”; simi- 
larly, "graphic”, deriving from the C-reek "graphikos", is prefixed by 
modifiers of Greek origin, such as "mono-", "di-", and "poly-". 

^ The qualifying adverb "usually" is employed because this corre- 
spondence is not essential. For example, if one should draw up a "sot of 
6(6 arbitrary single signs, it would be possible to represent the 2- 
lebter pairs from AA to ZZ by single symbols. This would still be a di- 
grapnic system. 

5 See subparo . 6 5 a and 66f for examples of two such systems and their 
names. " " ^ 

^ An analogy is found Jn chemistry, when two elements combine to form 
a molecule, the latter usually having properties quite, different from 
those of either of the constituent elements. For example, sodium, a 
metal, and chlorine, a gas, combine to form sodium chloride, common 
table salt, Howe/er, sodium and fluorine, also a gas similar in many 
respects to chlorine, combine to foru sodium fluoride, which is much 
different from table salt. 

f * 

^ For this reason the two letters arc marked by a ligature, that Is, 
by a bar across tneir tops. Jn cryptologic notation, the symbol GQ p 
itk.ans "any pluirlext digraph", the symbol 63 c , "any ciphertext digraph". 
To refer speeix >cally tc the 1st, 2-j., 3d,. . member of a ligature, the^ 
exponent 1, 2, will be used. Thus 8^ of REMp is the letter 33, 8^ 

of XRZ C is Z. See also footnote 1 on jag 0 ‘ ^ 
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d. The fundamental purpose of polygraphic substitution is again 
the suppression or the elimination of the frequency characteristics of 
single letters of plain text, just as is the case in monoalphabetic 
substitution with variants; hut here this is accomplished by a different 
method, the latter arising from a somewhat different approach to the 
problem involved in producing cryptographic security. When the substi- 
tution involves replacement of single letters in a monoalphabet ic system, 
even a single cryptogram can be solved rather readily; basically the 
reason for this is that the principles of frequency and the laws of 
probability, applied to individual units (single letters) of the plain 
text, have a very good opportunity to manifest themselves. However, 
when the substitution Involves replacement of plaintext units composed of 
two or more letters — that is, when the substitution is polygraphic in 
nature — the principles of frequency and laws of probability have a much 
lesser opportunity to manifest themselves . If the substitution is di- 
graphic, then the units are pairs of letters and the normal frequencies 
of plaintext digraphs become of first consideration; if the substitution 
is trigraphic, the units are sets of three letters and the normal fre- 
quencies of plaintext trigraphs are involved. In these cases the data 
that can her employed in the solution are meager; that la why, generally 
speaking, the adlution of polygraphic substitution ciphers is often ex- 
tremely difficult. 

e. By way of example, a given plaintext message of say n letters, 
enciphered by means of a uniliteral substitution system, affords n 
cipher characters, and the same number of cipher units. The same message, 
enciphered digraphically, still affords n cipher characters but only -g- 

cipher units. Statistically speaking, the sample to which the laws of 
probability now are to be applied has been cut in half. Furthermore , 
from the point of view of frequency;} the very noticeable diversity in 
the frequencies of individual letters; leading to the marked crests and 
tioughs of the uniliteral frequency distribution, is no longer so 
strikingly in evidence in the frequencies of digraphs. Therefore, al- 
though digraphlc encipherment, for example, simply cuts the cryptographic 
textual units in half, the number of Cipher units which must be identi- 
fied has been squared , and the difficulty of solution is not merely 
doubled but, if a matter of Judgment arising from practical experience 
can be expressed or approximated, mathematical] y, squared or cubed. 

f . The following two paragraphs will treat various polygraphic 
substitution methods . The most practical of these methods are digraphic 
in character and for this reason their treatment herein will be more de- 
tailed than that of trigraphic methods. 

65 . Polygraphic substitution methods employing large tables . — 

a. The simplest method of effecting polygraphic substitution in- 
volves’ the use of tables similar to that shown in Figure 4 Ya. This 
table merely provides equivalents for digraphs, by means of the coordi- 
nate system. Specifically, in obtaining the cipher equivalent of any 
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Figure 47a. 



plaintext digraph, the initial letter of the plaintext digraph i*s usea 
to indicate the row in which the equivalent is found, and the* final 
letter of the plaint art digioph indicate the column, the cipher digraph 
ia then fouid at the intersection of the row and column thus indicated . 
For example, K}pri’C c , , etc. 

b. In the preceding table two uixed sequences were employed to 
form the cipher equivalents, one seciUence being based on the key phrase 
ireSTHJGIiOirc.il UP BRAKE and the other cm GENERAL E1ECTRIC C015PAHY. The 
table in Figure 47a could have been clx’Lwn up in a slightly different 
manner, as shown in Figure 47b, and ’+110 yield the same cipher equiva- 
lents as before. Using this latter tauli, 9* for any plaintext digraph 

is found at the Intersection of Lho : or and column identified by &h and 

o 0 

0^, respectively, 8 C is foiu'd in the sequence below the table and is 

* 
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Figure hjb. 
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taken from the position directly under the column identified by 6^. A 

few sample encipherments will illustrate that this table is cryptographi- 
cally equivalent to that of Fig. 47a. 



c. Figures 48 and 49, below, contain other possible types of tables 
for dTgraphic substitution. In Fig. 48, it will be seen that there are 
two vertical sequences to the left of this table and no horizontal 
sequence below it. ©^ is located in the leftmost sequence, 6“ being 

O 

found directly to its side in the right-hand sequence, ©„ is then found 
at the intersection of the row and column identified by 0p and 0p, 
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Figure 48. 

respectively. The table in Fig. 49 provides digraphic equivalents by 
rpano of the coordinate system (e.g., RCp=JZ c ), in the same manner as In 
Fig. 47a, end a cursory examination of tne inside of the table might 
disclose nothing new about tha s table at ell. But, if one wore to scan 
closely the diagonals formed by each 9^ from upper right to lower left, 

he would soe that each such diagonal changes below the "Mp row”, simi- 
larly, if the diagonals formed by 6^ are scanned fiom upper lefb to 

lower light, it will be seen that each of them also changes after the 
"Up iow u . In effect, the inside of the table is divided into tvo sepa- 
rate poj fcions by an imaginary line extending horizontally between the 
II and U rows, but within eoch portion a straightforward type of symmetry 
is exhibited and the scon* two mixed sequences have been employed in each. 
Actually, in a 26x26 table, it is not possible to maintain the diagonals 
foiired thus by ©J aJl4 9 c a completely “unbroken” sequence without 

pioduclng repeated digrapns within the table and \rithout consequent 
cryptographic ambiguity, time, Fig. 49 illustrates one type of limited 
d->ngona3 symmetry whici im<s+ be leeorted to in the systematic con- 
struction of sues, a tab3e. 
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d. All of the foregoing tables have exhibited a symmetry in the 
arrangement of their contents, which is undesirable from the standpoint 
of cryptographic security. This systematic internal arrangement could 
be detected by a cryptanalyst early in his attack on cryptograms pro- 
duced through their use, permitting rapid reconstruction of the particular 
table involved, this subject will be given a more detailed treatment in 
par. 72. The table in Figure 50 is an example of one type of table which 
would provide more security than the foregoing. This table is con- 
structed by random assignment of values and shows no symmetry whatsoever 
in its arrangement of contents . It will be noted that this table is 
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Figure 49. 
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(Showing only a partially Ailed table) 
Final Letter (fl 2 D ) 
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Figure 50 * 

reciprocal in nature, that is AFp=YG c and TCp=AF c . Thus, this single 
table serves for deciphering as well as for enciphering. Reciprocity 
is, however, not an essential factor, in fact, greater security is pro- 
vided by non -reciprocal tables. But, in the case of such non-reciprocal 
randomly constructed tables, each enciphering table must have its comple 
mentary deciphering table. 

e. Digraphic tables employing numerical equivalents instead of 
letter equivalents may be encountered. However, since 676 equivalents 
are required (there being 676, or 26x26, different pairs of letters), 
this means that combinations of three figures must be vised, such systems 
are termed tr inome -digraphic systems, indicating clearly the number of 
elements which comprise the cipher units* By way of an example, the 
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folio-wing figure contains a fragment of a table 0 which provides trlnome 
equivalents for the plaintext digraphs: 



A 

B 

C 



ABODE 
001 002 003 004 005 
027 028 029 030 031 
053 054 



Y 

Z 



625 626 

651 652 



_Y Z 

025 026 
051 052 



649 650 

675 676 



Figure 51. 

f . All of the foregoing tables have been digraphic in nature, but 
a kind of false trigraphic substitution may also be accomplished by means 
of similar tables, as Illustrated in Figure 52, wherein- the table is the 
same as that in Figure 49 with the addition of one more sequence at the 
top of the table. In using this table, is located in sequence I, and 

J r 



It is interesting to note that this comparatively bulky and unwieldy 
table can be reduced to the following two alphabets with numerical 
equivalents for the letters: 

(1) ABCDEF XYZ 

000 026 052 078 io4 130 5980624 650 

(2) A B C D E F XYZ 

1 2 3 4 5 6 24 25 26 

In enciphering, the first letter of the plaintext digraph is converted 
into its numerical value from alphabet (l), and the second plaintext 
letter is converted by means of alphabet (2) , the two numerical values 
thus derived are added together, and their sum is taken as the cipher 
equivalent of the particular plaintext digraph. Of course, this simple 
reduction would not be possible if the trinomes, in ascending order, had 
been arranged in the table in, say, a diagonal manner. 
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III ABCDEFGHIJKLMNOPQRSTUVW^YZ 
IV RADIOCPTNFMEBGHJKLQSUVWXYZ 
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Figure 52. 

its equivalent, 0 C , taken from sequence II; 8^ is located in sequence 

III, and its equivalent, 0® taken from sequence 17; 0^ is the letter 

lying at the intersection of the row indicated hy in sequence I and 

the column determined hy 9p, Thus, FIRE LUJES would he enciphered NNZ 

IEQ KOV. Various other agreements may he made with respect to the 
alphabets in which each plaintext letter will he sought in such a table, 
hut the basic cryptographic principles are the same as in the case 
described. 

g. Tables such as those illustrated in Figs. 47-52, above, have 
been encountered in operational systems, but their use has not been vary 
widespread because of their relatively large size and the inconvenience 
in their production and handling. In lieu of these large tables it is 
possible to employ much smaller matrices or geometrical designs to ac- 
complish digraphic substitution, methods involving their use will be 
discussed in the following paragraph. 
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66 Polygiophic substitution methods employing omal] matrices .9 — 
a. A simple method for accompli shine digiaphlc substitution in- 
volves the use of the four -square matrix , n matrix consisting of four 
5r5 squares in which the letters of a 25 -element alphabet (combining X 
and J) are inserted in any prearranged order. When four such squares 
ore arranged in 0 matrix as shown in Figure 53 > the latter may he em- 
ployed for digraphic substitution to yield the same cipher results as 
does a much larger table of the type ireated m the preceding paragraph. 

1 2 

In a four-square matrix, 0_ of 99-, is sought in section 1, 9 , in section 

1 2 V 1 P 

2. Thus, 0p and 0p will always form the northwest-southeast corners of 

an imaginary rectangle delimited by these two letters as located m 

1 2 

those two sections of the square. Then 9 C and 9 C are, respectively, the 

letters at the northeast-southwest corners of this" same rectangle. Thus, 

TG =XS . UD =GH , OR j tffi =XB i etc. In decrypting, 0^ and 0!* ore 
p c' p c p c p c 7 c c 

sought in sections 3 and h t respectively, and their equivalents, and 

2 * 

6p, noted in sections 1 end 2, respectively. 
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Figure 53. 

b. It is possible to effect digraphic substitution with a matrix 
consisting of but two sections by a modification m the method of find- 
ing equivalents. In a horizontal two -square matrix , such as that shown 

,1 2 

m Figure 5^ > 0p of e©p is located in the square at the left, 0p, in the 

square at the right. 



o 

y The word matrix as emp] oyed in this paragraph refers to checker- 
board-type diagiams smaller than the tables illustrated in the preceding 
paragraph. These matrices are usually composed of sections containing 

25 cells each. 
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Figme 5h 



©2gl 

°p D c 






1 



and 0p are at tile opposite ends of the diagonal of an imaginary 



rectangle defined "by these ] etters , the ciphertext equivalent comprises 
the two letters appearing at the opposite ends of the other diagonal of 

_ — " r* fs 

the same rectangle, 0", is the particular one which is in the same row a3 

12 p — _ 

0p, and 0 C is the one in the same row as 0^. For example, Alp=TT c , 

DOp^GA^. . When 0p and 9p happen to he m the same row, the ciphertext 

_ * } 

equivalent is merely the reverse of the plaintext digraph, for example, 
ATpSFAc and EHpSHE c . ' J - 

c. Digraphic substitution may also he effected hy means of ver tical 
two -square matrices , in which one section Is directly above the other, as 
in Figure 55 > it will he noted that wati icesjof this type have a feature 
of x ec iproe ity when employed according to "the usual rules, which follow. 
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Figure 55. 
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When 6p and 9 p are at the opposite ends of a diagonal, the rule for en- 
cipherment is the same as that for horizontal two-square encipherment 
(e.g., i^p=UA c and UAp=MO c ) ; when both 6 p and 6 p happen to be in the same 
column, the plaintext digraphs are self -enciphered, (e.g., MApzMAg and 
ELpSEL,,), a fact which constitutes an important weakness of this method. 1- ® 
This disadvantage is only slightly less obvious in the preceding case of 

horizontal two-square methods wherein the cipher equivalent of 69 p con- 
sists merely of the plaintext letters in reversed order. 

d. One -square digraphic methods, with a necessary modification of 
the method for finding equivalents, are also possible. The first of 
this type to appear as a practical military system was that known as the 
11 

Playfair cipher . It was used for a number of years as a field cipher 
* by the British Army, before and during World War I, and for a short time, 
also during that war, by certain units of the American Expeditionary 
Forces. Figure $6 shows a typical Playfair square. The modification in 
the method of finding cipher equivalents has been found useful in 
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Figure 56, 

imparting a greater degree of security than that afforded in the preced- 
ing small matrix methods . The usual method of encipherment can be best 
explained by examples given under four categories: 



10 See subpar. 73 b on other enciphering conventions which remove this 
weakness . 

11 This cipher was really invented by Sir Charles Wheatstone but re- 
ceives its name from Lord Playfair, who apparently was its sponsor before 
the British Foreign Office. See Wemyss Reid, Memoirs of Lyon Playfair , 
London, 1899 . It is of interest to note that, to students of electrical 
engineering, Wheatstone is generally not known for his contributions to 
cryptography hut is famed for something he did not invent — the so-called 
Wheatstone bridge”, really invented by Samuel H. Christie. 
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(1) Members of the plaintext pair, 0 p and 0 p , are at opposite ends 

of the diagonal of an imaginary rectangle defined "by the two letters, 

1 2 

the members of the ciphertext pair, and ©£, are at the opposite ends 
of the other diagonal of this imaginary rectangle. Examples* MOp-ALg, 
Mip=UC c , m p =Qj^, VI p =YC c . 

(2) 9^ and 0 2 are in the same row, the letter immediately to the 

T* 7 p 

right of 0p forms the letter immediately to the right of 0* forms 
0£. When either 9 p or 0 p is at the extreme right of the row, the first 
letter in the row "becomes itB cipher equivalent. Examples* M^ p rM c , 
Hj p =AF Q i AFpSKMcJ FAp=M c . 

(3) ©i and 0 2 ore in the same column, the letter immediately below 

11 9 2 1 

0p foims 0 C , the letter immediately below 0 p forms 0£. When either © p 

or 0 p is at the bottom of the column, the top letter in that column be- 
comes its cipher equivalent. Examples MC p =CB c , AU p iTA c , WA p =AT c , 

, 2 

(4) 0 p and ©p are identical, they are to he separated by inserting 
a null, usually the letter X or Q, and subsequently enciphered by the 
pertinent rule from above. For example, the vord BATTLES would he en- 
eiplieied thus* 

BA TX TL ES 
DM W CO KP 

The Playfair square is automatically reciprocal so far as encipherments 
of type (l) above are concerned, hut this is not true of encipherments 
of type (2) and (3) . 

e. It is not essential that the small matrices used Tor digrophic 
substitution be in the shape of perfect squares, rectangular designs will 

12 

serve equally well, with little or no modification in procedure. For 
example, each section of, say, a four-square matrix could be constructed 
with four rows containing six letters each by having U p serve for V p , os 

well as T p for J p , Furthermore, it is possible to expand the sections of 

a digraphic matrix to 28, 3©, cu more characters by the following subter- 
fuge, without introducing digits or symbols into the cipher text .-*-3 one 



12 Ilowover, because the terms 11 four - s quo re matrix", "two-s quare matrix", 
and "Playfair square " have become firmly fi v ed in cryptologic literature 
and practice, they continue to be applied no all such matrices, even 
when the "squares" of such matrices do not contain on equal number of 
rows and columns (that is, even when, they are not square). 

"*3 The addition of any symbols such as the digits 1, 2, 3 ,».* i n ^o a 
matric solely to augment the number of elements to 27, 28, 3©, 32, or 36 
characters would not be considered piacticable, since such a p_oceJuie 
would result In producing cryptograms containing inter’ 'ixtuj.es oi lextoi-o 
°nd figures. 

l64 
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of the letters of the alphabet may be omitted from the sot of 2 6 lettei's, 
and this letter may then be replaced by 2, 3, or more pairs of letters, 
each pair having as one of its members the omilted singDe letter. The 
5x6 Playfair square of Figure 57a has been derived thus, the letter K has 
been omitted as a single letter, and the number of characters in the 
rectangle has been made a total of 30 by the addition of five combi- 
nations of K vith other letters . An interesting consequence of this 
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Figure 57a. 

modification is that certain irregularities are introduced in any crypto- 
gram produced through its use, for example, (l) occasionally a plaintext 
digraph is replaced by ciphertext trigraph or tetragraph, such as 

Mp =SkU e and EP p =KEK0 c ; and (2) variant values may appear— BKE e , DKE C , 
KEP C , GP C , and TP C all may be used to represent CK^. As far as the 

deciphering is concerned, there is no difficulty because any K occurring 
in the cipher text is considered as invariably forming a ligature with 
the succeeding letter, taking the pair of letters as a unit; and, when a 
plaintext unit is obtained containing on e of the K-pairs, the letter 
after the K is disregarded, for example, CK0 p is read as CK. The four- 
square matrix in Fig. 57b has also been constructed using the foregoing 
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Figure 57b. 
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subterfuge. With this latter matrix, numbers in the plain text maybe 
enciphered, still without producing cipher text containing numbers, for 
example, the plain text "HILL 3506" would be represented by the cipher 
QAB AT KUKI NQE, which would be regrouped into groups of five letters 
and sent as QABAT KUKEN QE... 



f . Figure 58 shows a numerical four-square matrix which presents a 
rather interesting feature in that it makes possible the substitution of 
3-figure combinations for digraphs in a unique manner. To encipher a 
message one proceeds os usual to find the numeiical equivalents of a pair, 
and then these numbers are added together Thus 
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Figure 58. 

In deciphering, the greatest multiple of 25 contained in the group of 
three digits is determined, then this multiple and its remainder are used 
to form the elements for determining the plaintext pair in the usual 
manner. Thus, 284=275 4 9=PR« 

g. Thus far all the small-matrix methods have involved only di- 
graphic substitution. The two matrices together illustrated in Figures 
59a and b may be used to provide a system for encipherment which 1 b 
partly trigraphic, the adverb "partly" has been used because this par- 
ticulai system will yield trigraphic encipherment approximately 88.5$ of 
the time in ordinary text and digraphic encipherment appioxlmotely 11.5$ 
of the time. 11 *' In this cese the cipher equivalents of the trigraphs 



^ These figures are based on the number of trigraphs ending in one of 
the 15 highest-fi equency letters (ETNRO AISDLHCFFU ) , and on the number of 
tiigiaphs ending with other letters. 
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Fig. 59b. 



(or digraphs, as the case may he) are tetranomes. Encipherment is "best 
illustrated hy an example, this is given in the next subparagraph. 

h. Let the text to he enciphered he a message beginning with the 
words "REFERRING TO YOUR MESSAGE NUMBER FIVE STOP ..." This is rewritten 
into trigraphs, with the proviso that the third letter of the trigraph he 
one of the letters contained in the small square in Fig. 59b, if the 
third letter is not one of these 15 letters, the plaintext grouping is 
left as a digraph, then the grouping into trigraphs (or digraphs) con- 
tinues . Thus , the foregoing plain text would he written as follows * 

REF ERR IN- GTO YOU RME SSA GEN UM- BER FI- VES TOP ... 

In encipherment, it is to he noticed that Rp occurs four times in section 
1 (as do all the letters) and Ep occurs four times in section 2j the 
proper combination of the l6 possibilities is determined hy the coordi - 
nates of the third letter of the trigraph as indicated in the small 
square, Fig. 59b. Since the coordinates of Fp in this square are 42, 
then it is the 4th occurrence of Rp in section 1 and the 2d occurrence 

of Ep in section 2 which are used to obtain the equivalent for the tri- 
graph REF-rj, this equivalent is 1905* When the plaintext unit as obtain- 

* 1 
ed above is only a digraph, it is the 1st occurrence of 9^ whj ch Is used 

o * 

in section 1 and the 1st occurrence of ©p which is used in section 2; 

thus, "IN-" from the sample message beginning, above, would be enciphered 
2828. The encipherment of the plaintext example above is then 

REF ERR IN- GTO YOU RME SSA GEN UM- BER FI- VES TOP 

1905 4o8l 2828 4719 0727 1372 74i7 4n8 2270 3807 4024 8806 8623 
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Figure 59a* 
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The cipher text could then he transmitted in groups of four digits, or, 
as a subterfuge to conceal the basic group length, the transmission 
could be in five -digit groups. In decipherment, the ciphertext tetra- 
nome is deciphered in the manner of the -usual four-square matrix, and 

1 2 

the location of the particular values for 0 p and © p will indicate the 
identity of the third plaintext letter, if any. 

jL. Now that the student has become familiar with the details of 
typical polygraphic substitution systems, he Is ready to continue his 
cryptanalytic s tudy with the treatment of methods for recognizing poly- 
giaphic substitution, these methods are described in the next paragraph. 

67. Methods for recognizing polygraphic substitution . — a. The 
methods used to determine whether a given cryptogram is digraphic In 
character are usually rather simple. If there are many repetitions In 
a cryptogram or a set of cryptograms and vet the uniliteral frequency 
disti lbution gives no cleai-cul Indications of monoalphabeticity, if 
most of the repetitions contain an even number of letters and these 
repetitions for the most part begin on he odd letters and end on the 
even letters of the message, yet the cipher text does not yield to so- 
lution os a biliteral cipher when the procedures outlined in Sections 
VII and VIII are applied to it, if the cryptograms usually contain an 
even number of letters (exclusive of nulls), and if the ciphei text Is 
in letters and all 26 letters are not piesent and J or U are among the 
absent letters (or 3f the cipher is in digits and there is a limitation 
in the range of the text when divided into tnnomes, this range usually 
being not greater than OOI-676), then the encipherment may be assumed to 
be digraphic In nature. 

b. Although the foregoing general remarks arc true as far as they 
go, occasionally they may be difficult to apply with any clcar-cut 
1 csults unless a large volume of cipher text is available for study. To 
supplement them there arc statistical tests which may he applied foi the 
recognition of digraphic substitution. Just as the <(> tust and the A 
test may be applied to the uniliteral distribution of ja ciyptogron to 
help determine iThether it is monoalphabetic. with respect to single- 
letter plaintext units, so may these same, tests be applied to the di- 
' r ranhic distribution of a cryptogram for the purpose of do Lei mining 
whether the cryptogiam in question is nrano alphabetic when considered os 
a dd graphic cipher* _ _ ' " 

c The basac form of the <j> test is the same when' applied to di- 
giaphic distributions as when applied to monographic — thac is, uni- 
liteial — distributions (see pnr. 27). it is only the plain and random 
constants that change, and "N" In the formulas now pertains to the number 
of digiophs under consideration. Instead of the number of single letters. 

■ rT 1 ^ •** ~ 
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To illus trate this, the formulas for computing the "digraphic phi plain 
(&l)" and the "digraphic phi random (<j>~)" are shown below* 

Jr * 

^ = .0069 N(N-l) 

$ 2 = .0015 N(K-l) i 

The "digraphic phi observed (<^)" is caJLculated in the usual manner, that 
is, by multiplying each f ("which in this case is found in one of the 
cells of a digraphic distribution) by f-1 , and then totalling all fche 
values thus derived. 

t 

d. The A 2 test (or the "digraphic blank-expectation test") may be 
applied to a digraphic distribution just as easily as its monographic 
counterpart is applied to a uniliteral frequency distribution. For this 
purpose. Chart 8 is given below, showing the average number of blanks 
theoretically expected in dlgraphic distributions for plain text and for 
random text containing various numbers of digraphs (up to 200 digraphs). 
As can he seen, the chart contains two curves . The one labeled P applies 
to the average number of blanks theoretically expected in digraphic 
distributions based upon normal plaintext messages containing the indi- 
cated number of digraphs. The other curve, labeled R, applies to the 
average number of blanks theoretically expected in digraphic distri- 

- ' ' ' a- 

-- - - - - ----- - I " 

13 ~ . 

The digraphic plain constant, . 0069 , was obtained by summing the 
squares of the probabilities of digraphs in English plain text; the 
digraphic random constant, .0015, is merely the decimal equivalent of 
1/676. Further elaboration on the use of these constants, among others, 
will be given in Military Cryptanalysis, Part II. 
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Chari 8. 

"but Ions based upon perfectly random assortments of digraphs. In using 
this chart one finds the point of intersection of the vertical coordi- 
nate corresponding to the number of digiaphs in the message, vith the 
horizontal coordinate corresponding to the observed nuraber of blanks in 
the dlgrnphic distribution for the message. If this point of inter- 
section falls closer to curve P than it does to curve R, this is evidence 
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that the cryptogram is digraphic in nature^; if it falls closer to 
curve R than to curve P, this is evidence that the cryptogram is not 
digraphic in character. 



e. Although it may not he necessary to resort to the use of the 

P **" * P 

cj) and A test to determine vhethei* or not a particular cryptogram has 
heen digraphically enciphered, it is veil to know the application of 
these tests, since use has heen made of them in difficult cases in 
operational practice. They may he helpfully employed in cases where the 
cryptanalyst is uncertain as to whether or not a single null has heen 
added at the hegiiming of a cryptogram suspected to he a digraphic 
cipher; and these tests may also he found useful in the analysis of com- 
plex cases where the digraphic encipherment has heen applied, not to 
adjacent letters of the plaintext message, hut to digraphs composed of 
more-or-less separated letters in the message. Elaborations of these 
ideas will he treated in Military Cryptanalysis, Part IX. 



f . As for the recognition of trigraphic substitution ciphers — if 
most of the repetitions are a multiple of three letters in length, if 
these repetitions for the most part begin (when the cipher text is 
divided into trigraphs) with the first letters and end with the third 
letters of the trigraphs, and if the length of the cryptograms is for 
the 4 most part a multiple of three letters , yet the cipher text does not 
yield to solution as a triliteral cipher, then the encipherment may he 
assumed to he trigraphic in nature* 

g . Just as the test may he used as an aid in the recognition of 
digraphicity, it may theoretically he used for recognizing the tri- 
graphic, tetragraphic , etc., nature of cryptograms, hut its use for 
these latter purposes is much more limited because of the “large amount 
of text which would he required to permit a valid application of the 
pertinent polygraphic <{> test. 



-- *“ v V- f •< n. 1 •* — . ,5“ ’ 1 r 

68 . General procedure in the identification and analysis of poly - 



graphic substitution ciphers . — A. Certain systems which at first glance 
seem to he polygraphic, in that groupings of plaintext letters are 
treated as units, are on closer inspection seen to he only partly poly- 
graphic in character. Such is true of systems involving large tables of 
the type illustrated in Figs. 47a and h, and 48 (in par, 65 ,' above), 



— 

Unfortunately, such would also he the case if the cryptogram under 
consideration were a polyalphabet ic cipher Involving two alphabets. 
However, to distinguish between a digraphic cipher and a polyalphabetic 
cipher with two alphabets, a digraphic distribution could he made "off 
the cut", that is, made of those ciphertext digraphs which are formed by 
omitting the first letter of text and then dividing the remaining text 
into groups of two letters. If the system were digraphic, such a distri 

p 

bution would exhibit a poor <|r j if the system were a two-alphabet substi 

p 0 

tution system, the <f > 0 would he as satisfactory as that of the regular 



distribution, taken "on the cut" . 
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vhoiexn one ipliermerrt is by pairs but one of the letters in eoch pair to 
enciphered monoalphabet ically, making these systems only pseudo -poly- 
grophic . For example, using the table in Figure ^8, any plaintext di- 
graph beginning with "A” must be enciphered by a cipheitext digraph 
beginning with "W " ; any plaintext digraph beginning with "B" must be 
enciphered by a ciphertext digraph beginning with "E"; etc. A crypto- 
gi am involving the use of this table may then be identified as such 
merely from a study of the uniliteral frequency distribution mode on the 
initial letters of the cipher digraphs, since such a distribution would 
perforce be monoalphabetic .17 

b. In certain other systems — namely, the four-square, two -square, 
and Playfair square systems of par. 66, above — the method of encipherment 
is by pairs, b\it the encipherments of the left-hand and right-hand 
members of the pairs show group relationships; this is not pseudo- 
polygraphic but, rather, partially -polygraphic . Cryptograms enciphered 
by means of systems of this latter type may not be readily identified as 
such merely througn an examination of their cipher text, but their so- 
lution may be effected rather rapidly as soon as a few correct plaintext 
assumptions have been made therein. A more detailed treatment of this 
matter will be given in succeeding paragraphs of this section. 

c. The analysis of cryptograms which have been produced by di- 
graphic substitution is accomplished largely by the application of the 

T ft 

simple principles of frequency of digraphs, with the additional aid of 
digraphic idiomorphs and such special circumstances as may be known to 
or suspected by the cryptanalyst. The latter refer to peculiarities 
which may be the result of the particular method employed in obtaining 
the equivalents of the plaintext digraphs in the encrypting process, 
such as those mentioned in subpars. a and b, above. In general, if there 
is sufficient text to disclose the normal phenomena of repetition and 
idiomorphism, or if cribs are available to be used as an entering wedge, 
solution will be feasible. The foregoing general statements will be 
expanded upon in the following two subparagraphs, d and e. 

d. Phen a digraphic system is employed in regular service, there 
is little doubt that traffic wil 1 rapidly accumulate to an amount more 
than sufficient to permit of solution by simple principles of frequency. 
Sometimes only two or three long messages, or a half -dozen of average 
length, are sufficient. For with the identification of only a few cipher 



37 

1 For this purpose, the simplest and most economical way to obtain 
the unilltcral distributions for the initial and final letters of di- 
graphs is to make a digraphic distribution and then add the tallies in 
each row to yield the distribution for bhe initial letters, and add the 
tallies in each column to obtain the distribution for the final Hettcrs. 

18 

In this connection, it would be well for the student to familiarise 
himself with that por bion of Appendix 2 which contains digraphic fre- 
quency data. 
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digraphs, larger portions of messages may be read because the; sl'-cletons 
of words formed from the few high-frequency digraphs very definitely 
limit the values that can be inserted for the intervening unidentified 
digraphs. For example, suppose that the plaintext digraphs RE, IN, OH, 
HD, HO, SI, HT, and TO are among those that have been identified by 
frequency considerations, corroborated by a tentatively identified long 
repetition; and suppose also that the enemy is known bo he using a large 
table of 676 cells containing digraphs showing reciprocal equivalence 
between plaintext and ciphertext digraphs . Suppose the message begins 
as follows (in which the assumed values have "been inserted). 



XQ 


VO 


ZI 


LK 


AP 


OL 
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PV GK 


IK OL 


UK AT 


HN LK 






ND 


IN 




NT 
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NT 
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DY 


TY 
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GI 










SI 




ON 


TO 















The initial words SECOHD INFANTRY REGIMENT are readily recognized. 
Furthermore, if CK c =GIp, then GI c =CKp, which suggests ATTACK as the last 

word in the message beginning. This fragment of the message may now he 
completely recovered. SECOHD INFANTRY REGIMEHT NOT YET IH POSITION TO 
ATTACK 



e. Just as the choice of probable words in the solution of uni- 
literal systems Is aided or limited by the positions of" repeated letters 
(see subpar. 49 a), so, in digraphic ciphers, is the placing of cribs 
aided or limited by the positions of repeated digraphs . In this con- 
nection, several frequent words and phrases containing repeated digraphs 
have been tabulated for the student's aid, and this list of digraphid 
idiomorphs is presented as Section D m Appendix 3 (q.v.). Thus, if one 
is confronted by a ciphertext message containing the following repeated 
sequence (therefore likely to represent an entire word) . 

- "* "VI • „t\t «•' " j- **, - <■ • 

VI FW HM AZ FF FW RO' 



* — 

he may refer to the appropriate section of Appendix 3 which will dis- 
close, on the basis of the idiomorphic pattern "AB — — — AB" starting 
with the second cipher digraph, that the underlying plaintext word may 
be RE EN FO RC EM EN T, among others . Once a good start has been made 
and a few words have been solved, subsequent work is quite simple and 
straightforward. A knowledge of enemy correspondence, including data 
regarding its most common words and phrases, Is of as much assistance in 
breaking down digraphic systems as it is in the solution of any other 
cryptosystems . 4 

1 1 * * 

_f . In the case of trigraphic substitution, analys is is made con- 
siderably more complex by the large amount of traffic required, not only 
for the initial entries, but also for further exploitation of the enter- 
ing wedges. In effect, the solution of a trigraphic system closely 
parallels the solution of the syllabary portion of a large two-part code, 
these techniques will be discussed in Military Cryptanalysis, Part V. 
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69. Analysis of four-square matrix systems . — a. In all the small- 
matrix methods illustrated in paragraph 66, the encipherment is only- 
partial] y digraphic because there are certain relationships between 
those plaintext digraphs which have common elements and their corre- 
sponding ciphertext digraphs, which will also have common elements. For 
example, jin the four-s quar e matrix given in Fig. 53 , it will be noted 
that AAp =FT C , AFp=FO c , ALprFMj,, AQp=FL c , and AVpcFKg. In each of these 

cases when Ap is the initial letter of the plaintext pair, the initial 

letter of the ciphertext equivalent is F c . This, of course, is the 

direct lesult of the method, it means that the encipherment is mono- 
alphabetic for the first half of each of these five plaintext pairs. 

This relationship holds true for four other groups of five pairs begin- 
ning with Ap, in effect, there are five cipher alphabets employed, not 

25 * Thus, this case differs from, the case discussed under subpar. 68a 
only in that the monoalphabeticity is complete, not for half of all the 
pairs but only among the members of certain groups of pairs . In a true 
digraphic system, such as a system making use of a 676-cell randomized 
table, relationships of the foregoing type are entirely absent, and for 
this reason such a system is cryptographically more secure than small- 
matrix systems . r - - - - 

b. From the foregoing it is clear that when solution has progressed 
sufficiently to disclose a few values, the insertion of letters within 
the cells of the matrix to give the plaintext -ciphertext relationships 
indicated by the solved values immediately leads to the disclosure of 
additional values. Thus, the solution of only a few values soon leads to 
the breakdown of the entire matrix. 

c. The following example will serve to illustrate the procedure. 

(l) Let the message be as follows. 
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(2) The cipher having "been tested for standard alphabets (by the 
method of completing the plain-component sequence) and found to give 
negative results, a uniliteral frequency distribution is made. It is as 
follows : 




ABCDEFGHlJKLMNOPQR 

H 18 20 8 10 30 17 22 24 0 8 14 ~U 18" 15 16 33 9 




STUVWXYZ 
6 11 11 0 " i 12 7 3 



(3) At first glance this may appear to the untrained eye to be a 
monoalphabetic frequency distribution, but upon closer inspection it is 
noted that, aside from the frequencies of four or five letters, the 
frequencies for the remaining letters are not very dissimilar. There 
are, in reality, no very marked crests and troughs --certainly not as many 
as would be expected in a monoalphabetic substitution cipher of equal 
length. The (j) test, if taken (this test, as a rule, is not necessary 
with samples of text of sizes such as this), would show unsatisfactory 
results (<f> o = 608 b, as against 4^=7870 and 4 r = ^5^3) • 

(4) The message is carefully examined for repetitions of 4 or more 
letters, and all of them are listed* 



Frequency Located in lines 



TFHHAYZLQCIAAIQUCBTP (20 letters) 2 H and L. 

QMLFEQQIGOI (11 letters) 2 C and F. 

XIFBEX (6 letters) 2 C and D. 

FEQQ 3 C, D, F. 

QMLF . 3 C, F, K. 

BFHM.. 2 B and G. 

B3PK 2 A and E. 

GOIH 2 D and K. 



Since there are quite a few repetitions, two of considerable length, 
since all but one of them contain an even number of letters, since these 
repetitions with but two exceptions begin on odd letters and end on even 
letters, and since the message also contains an even number of letters 
(344), the cryptogram is retranscribed into 2-letter groups for further 
study. It is as follows: 51 
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Message transcribed in parrs 





1 


a 


s 


4 


& 


s 


7 


8 


& 


10 


11 


12 


13 


14 


is 


A 


HF 


CA 


PG 


OQ 


IL 


BS 


PK 


MN 


DU 


KE 


OH 


QN' 


FB 


OR 


UN 


B 


QC 


LC 


HQ 


BQ 


BF 


HM 


AF 


XS 


10 


KO 


QY 


FN 


SX 


MC 


GY 


C 


XI 


FB 


EX 


AF 


DX 


LP 


MX 


HH 


RG 


KG 


QK 


QM 


LF 


EQ 


QI 


D 


.GO 


IH 


MU 


EO 


RD 


CL 


TU 


FE 


QQ. 


CG 


QN 


HF 


XI 


FB 


EX 


E 


>4— — 
FL 


BU 


QF 


CH 


QO 


QM 


AF 


IX 


SY 


CB 


EP 


FN 


BS 


PK 


NU 


F 


QX 


TX 


EU 


QM 


JLF_ 


EQ 


Qi 


GO 


_IE 


UE 


HP 


IA 


NY 


TF 


LB 


G 


FE 


EP 


ID 


HP 


CG 


NQ 


IH 


BF 


HM 


HF 


XC 


KU 


PD 


GQ 


PN 


H 


CB 


CQ 


LQ 


PN 


FN 


PN 


IT 


OR 


TE 


NC 


CB 


CN 


TF 


HH 


Jffl 


J 


,ZL 


QC 


IA 


AI 


QU 


CH 


-TP 


CB 


IF 


GW 


KF 


CQ 


SL 


QM 


CE 


K 


OY 


CR 


QQ 


DP 


RX 


FN 




LF 


ID 


GC 


CG 


10 


GO 


IH 


HF’ 


L 


IR 


CG 


GG 


ND 


LN 


OZ 


TF 


GE 


ER 


RP 


IF 


HO 


TF 


HH 


_AL 


M 


,ZL 


_3C_ 


IA 


AI 


_£U 


CH 


TP 
















" 



It is noted that all the repetitions listed above break up properly into 
digraphs except in one case, viz., FEQQ in lines C, D, and F. This 
latter seems rather strange, and at first thought one might suppose that 
a letter was dropped out or was added in the vicinity of the FEQQ in 
line D. But it may be assumed that the FE QQ in line D has no relation 
at all to the .F EQ Q. in lines C and F and is merely an accidental repe- 
tition. 

(5) A digraphic distribution is made as follows* 
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ABCDEFGHIJKLM50PQRSSUVWXYZ 



A 




■ 


a 








a 


■ 


9 


1 




1 






■ 


■ 


m 


■ 


a 




■ 


■ 










B 




m 




■ 


■ 




i 


■ 




■ 


■ 


■ 




m 


■ 


s 


m 


i 


i 


a 


m 


m 


I! 


■ 






C 


| 




■ 






m 


I 


9 




a 


■ 






m 




■ 


m 


■ 


■ 




a 


m 


■ 


i 






D 






m 




1 


a 


S 


■ 




■ 




■ 




m 


i 


a 


■ 


■ 


■ 


S 


a 


m 


a 




■ 


■ 


? 




m 






■ 




1 


a 


■ 










m 


a 


a 


9 


m 


a 


a 




9 


a 


0 


■ 




F 


| 


m 


i 




1 




■ 


a 






■ 


a 




m 


■ 


■ 


■ 


u 


■ 


■ 






u 


■ 


■ 


1 


D 




» 


m 




■ 


■ 


m 


■ 


» 


■ 










s 










■ 


■ 


m 


m 




m 


■ 


H 






■ 




■ 






m 


1 


■ 






m 






3 






■ 


1 




1 


■ 








I 


■ 


u 




■ 


m 


3 




t=? 




■ 








1 


a 








i 


E 














tf 








■ 


u 








# 








u 












i 


■ 


9 


m 


■ 


■ 


u 


■ 


K 










if- 


■ 


• 






1 


■ 








a 


a 






■ 


■ 


■ 




■ 


■ 


■ 


□ 


£ 


■ 


BL 


£* _ 


■ 




1 








E 


S 






m 


1 


1 


tm 




a 




■ 


a 


a 


■ 


a 


■ 


H 


■ 




« 






i 




1 




1 


■ 






m 


1 


i 




1 


E 




9 


1 


9 


9 


s 


E 


S 






a 








■ 


1 


■ 


■ 






■ 


■ 








E 


■ 












■ 


■ 


0 
















m3 




■ 


■ 


■ 


■ 


1 




■ 


■ 


a 


■ 


■ 










■ 


9 


p 




















■ 


m 




■ 


a 






■ 


■ 


















Q 






m 












a 


■ 


_ 




a 


a 






. 








ae 






■ 


M 




R 


■ 




■ 


■ 


■ 
















■ 


■ 


■ 


■ 


■ 


■ 
















« 


s 


■ 


■ 




■ 


■ 


■ 


■ 


■ 


■ 


■ 


■ 


■ 


■ 


■ 


■ 






■ 


■ 


■ 


■ 


■ 


■ 


■ 


■ 


■ 


T 












3 


■ 


■ 








■ 








ve 






















U 






















































V 






















































W 








































• 














X 








































) 












> 


Y 








































■ 














Z 
























c 
















1 


* 














4 


9 


8 


5 


7 


20 


8 


10 


7 




3 


6 


8 


13 


9 


9 


« 

12 


5 


r-v- 

3 1 


9 


- 


1 


8 


7 


1 



7 
6 

3L8 

3 

9 
10 

9 

13 

16 

5 

8 

4 

5 

6 
7 

22 

4 

3 

10 
2 



4 

2 



Figure 60. 



( 6 ) The appearance of the foregoing distribution for this message 
is quite characteristic of that for a digraphic substitution cipher. 
Although there are 676 possible digraphs, only 107 are present in the 
distribution, this parallels what is expected of normal plain text, since 
out of the 676 possible two-letter combinations (including "impossible 
plaintext digraphs” such as QQ, JK, etc., which might have been used for 
special indicators, punctuation marks, etc.) only about 300 are usually 
used m the construction of plain text. 1 ? The number of blank cells. 



!9 The 300 most frequent digraphs comprise 95$ of normal English plain 
text (Appendix 2, Table 7-A). 
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569 , closely approximates the 565 vhich would he expected in a distri- 
bution made on a sample of plain text of this size, as shown "by Chart 8. 
Furthermore, although there are many cases in which a digraph appears 
only once, there are quite a few in which a digraph appears two or three 
times, four cases in which a digraph appears four times, one case in 
which a digraph appears five times, and one in which a digraph appears 
six times. All of the foregoing observations concerning the distribution 
are reflected hy the 4 test: the observed digraphic phi value, 210, 

-compares very favorably with the expected plain value (=.0069 x 172 x 
171 s 203) as against the expected random value (r.0015 r 1J2 x 171 - 
44) , Thus all indications point to a digraphic substitution system. 

(7) Since neither the <|>o 0 780 ) and A 0 (4) for the initial leiters 

of the cipher digraphs nor the <j> 0 ( 1496 ) and A 0 ( 2 ) for the final - 

letters are too satisfactory in their approximation to the values ex- 
pected for monoalphabet lc distl ibutions (4p=19^2 and 4 r =1133j A p=5 and 

A r r0), the possibility of a pseudo -digraphic system is ^jruled out. There 

remain >the possibilities of a partially -digraphlc system employing a 
small matrix, or a true digraphic system employing a large, randomized 
table. In one common type of small-matrix system, the Playfair cipher, 
'one of the telltale indications besides the_ absence ( of (usually) the 
letter J is the absence of cipher doublets, that is, two successive 
identical cipher letters. The occurrence of the double letters GG, HR, 
and QQ in the message under investigation eliminates the possibility of 
its being a normal Playfair cipher. For want of more accurate diagnostic 

pA pi \ 1 

criteria at this stage , the simplest thing to assume, from among the 
various hypotheses that remain to be considered, is that a four-square 
matrix is involved. One yith norpal alphabets (as being the simplest 
case) in Sections 1 and 2_is therefore set-down (Figure 6la) . , 



20 r# it 

Even a medical practitioner often cannot successfully diagnose a 
condition on the first visit. Cryptanalytieally speaking, we are still 
on our "first visit". Subsequent probing will, we hope, reject or sub- 
stantiate this or that hypothesis or assumption, until the patient (the 
cipher text) is recovered '(i.e., brought back to plain text). v -* 

21 

However, see the treatment on the diagnosis of various types of 
digraphic systems in subpar 73^. 
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A 


B 


C 


D 


E 












F 


G 


H 


I-J 


K 












L 


M 


N 


0 


P 












Q 


R 


S 


T 


U 












V 


W 


X 


Y 


Z 






















A 


B 


C 


D 


E 












F 


G 


H 


S3 


K 












L 


M 


N 


0 


P 












Q 


R 


S 


T 


U 












V 


W 


X 


Y 


z 



Figure 6 la. 



(8) The recurrence of the group QMLF, three times, and at intervals 
suggesting that it might be a sentence separator, leads to the assumption 
that it represents the word STOP. The letters Q, M, L, and F are there- 
fore inserted in the appropriate cells in Sections 3 and 4 of the diagram. 
Thus (Fig. 6lb) • 



A 


B 


C 


D 


E 












F 


G 


H 


I-J 


K 












L 


M 


N 


0 


P 










L 


Q 


R 


S 


T 


U 








Q 




V 


W 


X 


Y 


z 






















A 


B 


C 


D 


E 












F 


G 


H 


I-J 


K 








F 




L 


M 


N 


0 


P 






M 






Q 


R 


S 


T 


U 












V 


w 


X 


Y 


z 



Figure 6lb . 



These placements seem rather good from the standpoint that keyword-mixed 
sequences may have been used in these two sections. Moreover, in Section 
3 the number of cells between L and Q is just one less than enough to 
contain all the letters M to P, Inclusive, this suggests that one of 
these letters, probably If or 0, is in the keyword portion of the sequence. 
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that is, near the top of Section 3. Without making a commitment in the 
matter, let us suppose that M follows L and that P precedes Q; then let 
both N and 0, for the present, he inserted in the cell between M and P. 
Thus (Fig. 6 lc)* 




A 


B 


C 


D 


E 












F 


G 


H 


I-J 


K 












L 


M 


N 


0 


P 










L 


Q 


R 


S 


T 


U 


M 


N 

0 


P 


Q 




V 


W 


X 


y 


Z 






















A 


B 


C 


D 


E 












F 


G 


H 


I-J 


K 








F 




L 


M 


N 


0 


P 






M 






Q 


R 


S 


T 


U 












V 


W 


X 


y 


Z 



Figure 6lc . 



( 9 ) Now, if the placement of P in Section 3 is correct, the cipher 
equivalent of THp will he P0 C , and there should he a group of adequate 

frequency to correspond. Noting that PN_ occurs three times, it is as- 
sumed to represent THp and the letter N is inserted in the appropriate 

cell in Section k . Thus (Fig . 6 ld) • 




A 


B 


C 


D 


E 












F 


G 


H 


I-J 


K 












L 


M 


N 


0 


P 










L 


Q 


R 


S 


T 


u 


M 


8 


P 


Q 




V 


W 


X 


y 


z 






















A 


B 


C 


D 


E 








N 




F 


G 


H 


I-J 


K 








F 




L 


M 


N 


0 


P 






u 






Q 


R 


S 


T 


U 












V 


W 


X 


Y 


Z 



Figure 63d. 
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(10) It is .about tune to try out these assumed values in the message. 
The proper insertions are made, with the following results. 





1 


2 


3 


i 


5 


8 


7 


8 


0 


JO 


n 


22 


13 


u 


16 


A 


HF 


CA 


PG 


OQ 


IL 


BS 


PK 


MN 


DU 


KE 


OH 


QN 


FB 


OR 


UN 


B 


QC 


LC 


HQ 


BQ 


BF 


HM 


AF 


XS 


10 


KO 


QY 


FN 


SX 


MC 


GY 


C 


XI 


FB 


EX 


AF 


DX 


LP 


MX 


HH 


RG 


KG 


QK 


QM 


LF 


EQ 


QI, 


























ST 


OP 






D 


( GO 


IH 


MU 


EO 


RD 


CL 


TU 


FE 


QQ 


CG 


QN 


HF 


XI 


FB 


EX 


E 


FL 


BU 


QF 


CH 


QO 


QM 


AF 


TX 


SY 


CB 


EP 


FN 


BS 


PK 


NU 














ST 




















F 


QI 


TX 


EU 


M_ 


LF 


_EQ_ 


J3L. 


GO 


IE 


UE 


HP 


IA 


NY 


TF 


LB 










ST 


OP 






















G 


FE 


EP 


ID 


HP 


CG 


NQ 


IH 


BF 


HM 


HF 


XC 


KU 


PD 


GQ 


PN 
































TH 


H 


CB 


CQ 


LQ 


PN 


Fn 


PN 


IT 


OR 


TE 


NC 


CB 


CN 


TF 


HH 


AY, 










TH 




TH 




















J 


<ZL_ 


_gc_ 


IA 


AI 


_QU_ 


CH 


TP 


CB 


IF 


GW 


KF 


CQ 


SL 


QM 


CB 






























ST 




K 


OY 


CR 


QQ 


DP 


RX 


FN 


MI 


"LF 


ID 


GC 


CG 


10 


GO 


IK 


HF 
















ST 


OP 
















L 


IR 


CG 


GG 


ND 


LN 


OZ 


TF 


GE 


ER 


RP 


IF 


HO 


TF 


HH 


A Y> 


M 




_SC_ 


IA 


AI 


_QIL 


CH 


TP 



















(ll) So far no impossible combinations are in evidence. Beginning 
with group H4 in the message is seen the following sequence. 

' PIJIPH 
T H . . T H 

* t 

Assume it to be THAT THE. Then ATp=FH c , and the letter N is to be in- 
serted in row 4 column 1 of Section 4 . But this is inconsistent with 
previous assumptions, since il in Section 4 has already been tentatively 
placed in row 2 column 4. Other assumptions for FN C are made* that it 
is, IS p (THIS TH...), that it is EN p (THEN Til...), but the same incon- 
sistency is apparent. In fact the student will see that FN C must re- 
present a digraph ending in F, G, H, I-J, or K, since N c is tentatively 
located on the same line as these letters in Section 2. Now FIf c occurs 
4 times m the message. The digraph it represents must be one of the 
following • 



DF, 


DG, 


DH, 


DI, 


DJ, 


DK 


OF, 


OG, 


OH, 


01 , 


OJ 


IF, 


IG, 


IH, 


II, 


IJ, 


IK 


TK, 










JF, 


JG, 


JH, 


JI, 


JJ, 


JK 


IF, 


YG, 


YH, 


YI, 


YJ 
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Of these the only one likely to be repeated 4 times is OF, yielding 



FNFNPK 

T II 0 F T H which may be a part of 

CQLQPNFNPNIT C 0 L Q P H F M P ll T 

NOHTHOFTHE. or .SOUTHOFTHE 



In either case, the position of the F m Section. 3 
F . L in row 3 Theie are 3 cells intervening 
which G, II , I-J, and K may be inserted. It is not 
G, H, and K are m the keyword as that I should be 
assumed that this is the case, and let the letters 
in the appropriate cells in Section 3 Thus (Fig 



is excellent 
between F and L, into 
nearly so likely that 
in it Let it be 
G, H, and K be placed 
6le) 
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B 
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D 


E 
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I-J 


K 
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0 
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t 
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H 


K 


L 
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R 
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T 


U 


'm 
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0 


P 


Q 




V 


W 
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Y 
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} 

j 


1 
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H 


I-J 
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M 
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0 
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T 


U 
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W 


X 


Y 


Z 



Figure 6le. 





Let the resultant derived values be checked against the frequency dis- 
tribution. If the position of H in Section 3 Is correct, then the di- 
graph ONp, normally of high frequency, should be represented several 

times by HF C . Reference to Fig . f 6o shows HF C to have a frequency of 4 . 

And HMq, with 2 occurrences", represents flSp. There is no need to go 
through all the possible corroborations. 

P N F R P H 

(l2) Going back to the assumption that I H . . T H is part of the r 
expression - 1 ~ — , 

C QLQPNFHPNIT C QLQPNFNPHIT 
.NORTHOFTHE. or .SOUTHOFTHE., 

it is seen at once from Fig 6le that the latter is apparently correct 
and not the former, because LQ C eauals QUp and not ORp. If OSprCQc, this 

% 
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means that the letter C of the digraph CQ c must be placed in row 1 column 
3 or row 2 column 3 of Section 3» Now the digraph CB c occurs 5 times, 

CG c , 4 times , CH<. , 3 times , CQq , 2 times . Let an attempt be made to 

deduce the exact position of G in Section 3 and the positions of B, G, 
and H in Section h. Since F is already placed in Section 4, assume G 
and H directly follow it, and that B comes before it. How much before? 
Suppose a trial be made. Thus (Fig. 6lf) 
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B 


C 


D 


E 






C? 
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H 
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H 
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Y 


Z 



Figure 6lf 

* ~ - r - 

By referring now to the frequency distribution. Fig. 60, after a very 
few minutes of experimentation it becomes apparent that the following is 
correct* ... ' 
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B 
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D 


E 






C 
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H 


I-J 
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Figure 6lg. 
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(13) The identifications given by these placements are inserted 
the text, and solution is very rapidly completed The final matrix 
deciphered text are given below. 



A 


B 


C 


D 


E 


S 


0 


C 


I 


E 


F 


G 


H 


I-J 


K 
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Y 
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W 
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I-J 
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Z 



Figure 6lh. 
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I 


N 
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C 


I 
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T 


Y 
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F 


B 


C 
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I 


F 


B 


E 


X 


A 


F 


D 


X 


L 


P 


M 


X 


H 


H 


R 


G 


K 


G 


Q 


K 


Q 


M 


L 




w 


I 


L 


L 


B 


E 


I 


N 


G 


E 


N 


E 


R 


A 


L 


S 


U 


P 


P 


0 


R 


T 


s 


T 


0 


D 


G 


0 


I 


H 


M 


U 


E 


0 


R 


D 


C 


L 


T 


U 


F 


E 


Q 


Q 


C 


G 


Q 


N 


H 


F 


X 




N 


G 


A 


T 


T 


A 


C 


K 


S 


P 


E 


C 


I 


A 


L 


A 


T 


T 


E 


N 


T 


I 


0 


N 


w 


E 


F 


L 


B 


U 


Q 


F 


C 


H 


Q 


0 


Q 


M 


A 


F 


T 


X 


S 


Y 


C 


B 


E 


P 


F 


N 


B 




P 


A 


I 


D 


T 


0 


A 


S 


S 


I 


S 


T 


I 


N 


G 


A 


D 


V 


A 


N 


C 


E 
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F 


F 


F 


Q 


I 


T 


X 


E 


U 


Q 


M 


L 


F 


E 


Q Q 


I 


G 
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I 


E 


U 


E 


H 
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I 


A 


N 




R 


I 


G 


A 


D 


E 


S 
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T 
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E 
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D 


H 


P 


C 


G 


N 


Q 


I 


H 


B 


F 


H 


M 


H 


F 


X 


C 


K 


U 


P 




L 


A 


C 


E 


C 


0 


N 


C 


E 


N 


T 


R 


A 


T 


I 


0 


N 


S 


0 


N 


w 


0 


0 


D 


S 


II 


C 


B 


C 


Q 


L 


Q 


P 


N 


F 


N 


P 


N 


I 


T 


0 


R 


T 


E 


N 


C 


c 


B 


C 


N 


T 




A 


N 


D 


S 


0 


U 


T 


H 


0 


F 


T 


H 


A 


Y 


E 


R 


F 


A 


R 


M 


A 


N 


D 


H 


I 


J 


Z 
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Q 


C 


I 


A 


A 


I 


Q 


U 


C 


H 


T 


P 


C 


B 


I 


F 


G 


W 


K 


F 


C 


Q 


S 




Z 


E 


R 
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E 


I 


G 


H 


T 


D 


A 


S 


H 


A 


A 


N 


D 


0 


_N 


W 


0 


0 


D 


s 


E 


K 
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Q 


D 


P 


R 
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N 
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L 
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D 


G 
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G 
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T 
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E 
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S 
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E 


D 


0 


N 


H 


I 


M 


Z 


L 


Q 


G 


I 


A 


A 


I 


Q 


U 


C 


H 


T 


P 


























Z 


E 


R 


0 


E 


I 


G 


H 


T 


D 


A 


S 


H 


A 

























BORU 

LERY 

X M C G 
A R L 0 

F E Q Q 
PDUR 

I F B E 
I LLB 

S P K N 
I R S T 

Y T F L 
WILL 

DGQP 
N 0 R T 

F H H A 
L L S I 

L Q M C 
A S T A 

0 I H H 
G A T 0 

F H H A 
L L S I 



in 

and 



N 

F 

Y 
W 

I 

I 

x J 

E 

U 

B 

B 

P 

N 

H 

Y 
X 

B 

N 

F 

N 

Y 
X 
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d. In the solution of four-square cryptograms, advantage may "be 
taken not only of the general type of digraphic j&lomorphs mentioned in 
subpar. 68e, above, but also of a special type of partial ldiomorphism 
present in any four-square cryptograms involving the use of a matrix in 
which the plain components consist of normal alphabets normally in- 
scribed. 22 As an illustration, let the digraphs SO UT (H.) be enciphered 
by means of any four-square having normal alphabets m Sections 1 and 2, 
and it will be found that In the encipherment the initial letter of the 

cipher digraph representing S0 p will be identical to the initial letter 
of the cipher digraph representing UT p , regardless of how the cipher com- 
ponents are constructed. On this basis a blief list of specialized 
single -letter patterns have been compiled for use in" the solution of such 
a digraphic system, this list of "four-square digraphic idiomorphs" con- 
stitutes Section F of Appendix 3* 

e. It is interesting to note how much simpler the technique of 
analysis is in the case of so-called inverse four-square ciphers, which 
involve the use of a matrix wherein the ciphertext sections contain normal 
alphabets, the plain components being mixed. For example, referring to 
Fig. 53 , suppose that Sections 3 and 4 are used as the source of the 
plaintext pairs, and Sections 1 and 2 as the source of the ciphertext 
pairs, then ON p =ET c , EHp=GE c , etc. The simplicity of the analytic pro- 
cedure will be made clear by the following exposition. 

(1) To solve a message enciphered with an Inverse four-square matrix, 
it is necessary to perform two steps. First, convert the ciphertext pairs 
Into their plain-component equivalents by "deciphering" the message with a 
matrix in which all four sections contain normal alphabets, this operation 
yields two uniliteral substitution "ciphers", one composed of the odd 
letters, the other of the even letters. The second step is to solve these 
two monoalphabetic portions 

(2) As an example, let us consider the following cipher text, known 
(or assumed) to have been encrypted with a trinome-digraphic 2 ^ system 



If any other known plain components were involved, the procedure of 
deriving a list of idiomorphic patterns would be modified to fit the 
particular case. 

If the cipher text were being examined "from cryptanalytic scratch", 
the limitations (003-595) of the cipher text when the latter is divided 
into trinomes for examination would have at once indicated that this 
grouping is the one which merits detailed analysis. The <f> 2 test would 
then give an indication of the digraphic nature of the underlying crypto- 
graphic treatment. 
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incorporating a four-square matrix similar to that illustrated in Fig. 
except that the plain-component sections have "been changed 



2 0 3 2 3 
2 7 8 3 5 
2 0 8 0 5 
4 6 8 1 8 
2 4 3 0 2 
15 3 13 
4 6 5 3 6 

T 8 2 1 

O 8703 
0 0 7 4 1 



85081 
4 1 8 0 4 
7 4 13 5 
3 3 9 3 0 

83832 
6 1 0 4 1 

6 2 6 6 3 

7 3 9 3 3 
7 0 9 1 4 
3 3 3 8 1 



83450 
5 0 4 1 3 
3 5 4 7 3 
9 13 9 3 

2 8 3 5 9 
ooi44 
44007 
81193 

1 9 3 9 1 

3 3 5 9 3 



2 7 9 3*1 
27416 
32626 
4 1 1 0 4 

3 8 0 2 2 
10 10 1 
1 8 3 4 5 
4 7 9 2 4 
11607 

3 9 3 4 0 



11503 

3 3 0 9 1 
9 116 0 

4 13 3 1 

6 10 4 3 
8 2 4 0 3 
b l 4 0 2 
0 4 0 3 2 

7 13 7 1 
6 3 5 3 1 



09168 
0 10 9 2 
0 3 2 1 8 
17296 
69130 

3 6 1 6 8 
88152 

4 13 0 6 

5 3 5 9 5 
8 8 13 3 



(3) The first thing to he done is to construct a four-square matrix 
with the known ciphertext sections, and inscribe arbitrary alphabets in 
the pi „ext sections, as follows* 



A 


B 


C 


D 


E 


000 


025 


050 


075 


100 


F 


G 


H 


I 


K 


125 


150 


175 


200 


225 


L 


M 


XT 


0 


P 


250 


275 


300 


325 


350 


Q 


R 


S 


T 


u 


375 


400 


425 


450 


475 


V 


W 


X 


Y 


z 


500 


525 


550 


575 


600 




1 


2 


3 


4 


A 


B 


c 


D 


E 


5 


6 


7 


8 


9 


F 


G 


H 


I 


K 


10 


11 


12 


13 


14 


L 


M 


H 


0 


P 


15 


16 


17 


18 


19 


Q 


R 


S 


T 


U 


20 


21 


22 


23 


24 


V 




X 


Y 


Z 





(4) The cipher text is then written xn trinomes , and these trinomos 
are "deciphered” by means of the foregoing matrix, yielding the converted 
cipher text as follows: 




■ RESTRICTE D 



186 




RESTRICTED 



REF ID:A56892 



jJ 



5 10 15 



A 


203 


238 


508 


183 


450 


279 


*+ Y 

341 


■k 

150 


309 


168 


278 


"3$ 


180 


450 


413 




ID 


IP 


YF 


IH 


OP 


PB 


MP 


FB 


PH 


IR 


OB 


Ptf 


EH 


OJD 


TM 


B 


274 


163 


309 


101 


“092" 


‘208 


057 


ll3 _ 


535 


473 


326 


269 


11 6 


003 


218 




PV 


XM 


PH 


BE 


CT 


II 


CH 


TM 


VM 


TY 


MD 

rK 


PQ 


BU 


DA 


IT 


C 


468 


183 


393 


091 


393 


4ii 


044 


133 


117 


296 


1 

2^3 


028 


383 


228 


359 






IH 


TQ 


BT_ 


TQ 


_RM 


ER 


IF 


cu 




IU 


Ub 


TF 


IE 


FK 


D 


380 


226 


104 


369 


130 


153 


136 


io4 


100 


144 


101 


018 


240 


336 


168 




OF 


GE 


EE 


FU 


FF 


IB 


GL 


EE 


AE 


KQ 


BE 


DQ 


FU 


MQ 


IR 

m 


E 


465 


366 


266 


344 


007 


183 


450 


l4o 


288 


152 


478 


217 


393 


381 


193 




OT 


MU 


MQ 


PT 


CF 


IH 


QD 


FQ 


OM 


HB 


TE 


HT_ 


TQ 


_RF 


IS 


F 


479 


240 


403 


24l 


306 


087 


037 


091 


419 


391 


13 6 


077 


137 


153 


595 




UE 


FU 


TB 


GU 


MH 


CO 


CM 


BT 


UR 


RQ 


BU 


CD 


HL 


IB 


VY 


G 


007 


413 


338 


133 


593 


393 


4o6 


353 


188 


133 














CF 


TM 


00 


IF 


YT_ 


TQ 


_RG 


OE 


m 


IF 













The distributions "of the letters constituting the initial letters and 
final letters of the converted digraphs afe as follows 



2-f sr 

-_s 5s s gf 

M = ~a=— 5 = e= 5=5 jz= _ __ „ 

(Initial Letters) ABCBEFGHIKLMNOPQRSTUVWXYZ 



(Final Letters) 



s _is s s M £g 



ABCDEFGHIKLMNOPQRSTUVU 



X Y Z 



(5) p .Using straightforward principles of frequency and partial idio- 
morphs, the plain text (beginning with the opening words EHEMY 
KECOMAISSAHCE . . . ) is recovered, and the following equivalents are obtain- 
ed for the converted cipher letters of the two alphabets 

1 f-a. 

(Initial Letters) C ABCDEFGHIKLMHOPQRSTUVWXYZ ^ 

P BRAHMSCDEF I LHOP TUV Y 

M Jt-t . _ 



(Final Letters) 



C ABCDEFGHIKLMNOPQRSTUVUXYZ 
P* W A NERBCDFHIKLMOPQSTUV Y 



^ Rote the ABA pattern of the first word in the message (ENEMY), made 
patent by the two -alphabet conversion process. Also note the 3-fold 
repetition (representing the plaintext word STOP) which, although hidden 
In the original cipher text, now comes to light. 
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Keyword-mixed sequences directly manifest themselves because the original 
enciphering matrix contained such sequences in Sections 1 and 2 , inscribed 
in the same manner as were the arbitrary A-Z sequences which were used for 
the conversion. In fact, the key words of the two distributions might 
have been recovered from an analysis of the "profiles" of the distributions 
above, as described in subpar 

( 6 ) The original enciphering matrix is then reconstructed, thus 



B 


R 


A 


H 


M 


000 


025 


050 


075 


100 


S 


C 


D 


E 


F 


125 


150 


175 


200 


225 


0 


I 


K 


L 


N 


250 


275 


300 


325 


350 


0 


P 


Q 


T 


U 


375 


4oo 


425 


450 


475 


V 


w 


X 


Y 


Z 


500 


525 


550 


575 


600 


i 


1 


2 


3 


4 


w 


A 


G 


N 


E 


5 


6 


7 


8 


9 


R 


B 


C 


D 


F 


10 


11 


12 


13 


14 


H 


I 


K 


L 


M 


15 


16 


17 


18 


19 


0 


P 


Q 


S 


T 


20 


21 


22 


23 


2k 


U 


V 


X 


Y 


Z 



(7) Although the example illustrated was that of a numerical di- 
graphic system, it is obvious that this technique of solution also applies 
to literal four-square systems in which the cipher components are known 
sequences. It should be clear to the student the tremendous difference it 
makes when it is possible to convert a digraphic system into a two-alphabet 
system, in a digraphic system, we are plagued by a potential 67(1 different 
elements in the cipher, whereas in a two-alphabet system we still have 
only 26 elements (in each of two sets, it is true) in the cipher text to 

be solved This principle of conversion of cipher text Into a secondary 
cipher text has application in some of the most complex types of crypto- 
systems, the student would do well to keep this in mind 

( 8 ) As a further observation on inverse four-square systems, it is 
pointed out that where the same mixed alphabet is present in Sections 3 
and 4, the problem Is still easier, since the letters resulting from the 
conversion into plain-component equivalents all belong to the same, single 
mixed alphabet, thus such a digraphic system is reduced to an ordinary 
simple substitution cipher. 

f . The solution of cryptograms enciphered by other types of small 
matrices is accomplished along lines very similar to those set forth m 
subparagraph c on the solution of a four-square cipher, this will be 
illustrated in subsequent paragraphs There are, unfortunately, few means 
or tests which can be applied to determine in the early stages of the 
analysis exactly what type of digraphic system is involved in the first 
case under study The author freely admits that the solution outlined in 
subparagraph c is quite artificial m that nothing is demonstrated in step 
( 7 ) that obviously leads to or warrants the assumption that a four-square 
matrix is involved. The point was passed over with the quite bald state- 
ment that this was "from among the various hypotheses that remain to be 
considered" — and then the solution proceeded exactly as though this_mere 
hypothesis had been definitely established For e/ample, the very first 
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results obtained vere based upon assuming that a certain 4-letter repe- 
tition represented the word STOP and immediately inserting certgin letters in 
appropriate cells in a four-square matrix with normal sequences in Sec - 
tions 1 and g . Several more assumptions were built on top of 'that, and 
very rapid strides were made What if it had not been a four-square 
matrix at all? What if it had been some other type of not icadily identi- 
fiable digraphic system? The only defense that can be made of what may 
seem to the student to be purely arbitrary procedure based upon the 
author's advance information or knowledge is the following. In the first 
place, in order to avoid making the explanation a too-long-drawn-out 
affair, it is necessary (and pedagogical experience warrants) tnat certain 
alternative hypotheses be passed over in silence. In the second place , 
it may now be added, after the principles and procedure have been eluci- 
dated (which at this stage is the primary object of this text) that if 
good results do not follow from a first hypothesis, the only thing the 
cryptanalyst can do is to reject that hypothesis and formulate a second 
hypothesis In actual practice tie may have to reject a second, third, 
fourth, ...nth hypothesis. In the end he may strike the right one — or he 
may not. There is no assurance of success in the matter In the third 
place, one of the objects of this text is to show how certain crypto- 
systems, if employed for military purposes, can readily be broken down 
Assuming that some type of digraphic system is in use, and that daily 
changes in key words are made, it is possible that the traffic of the 
first day might give considerable difficulty in solution if the specific 
type of digraphic system were not known to the cryptanalyst But by the 
time two or three days' traffic had accumulated it would be easy to solve, 
because probably by that time the cryptanalytic personnel would have suc- 
cessfully analyzed the cryptosystem and thus learned what type of matrix 
or table the enemy is using 



70 Analysis of two-square matrix systems — a Cryptosystems in- 
volving either vertical two-square or horizontal two-square matrices may 
be Identified as such and solved by capitalizing on the cryptographic pe- 
culiarities and idlosyncracies of these systems It will be noted that, 
considering the mechanics of the cryptosystems, in vertical two -square 
matrices employing the normal enciphering conventions, 2 ^ exactly 20$ of 
the 625 "possible" plaintext digraphs will be "transparent" (i.e , scif- 
enciphered) in cipher text, in horizontal two-square systems, exactly 20$ 
of the 625 digraphs will be characterized by an "inverse transparency" 



25 

That is, for vertical two-square systems, digraphs are self -enciphered 
1 2 

if 6^ and ©„ fall in tne same column in the matrix, and, for horizontal 

* ^ *1 P 

two-square systems, if 0^ and 0 are in the same row, the ciphertext &i- 

Xr Jr 

graphs are the reversed plaintext digraphs 
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(i.e., enciphered by the same digraphs reversed).^ Therefore, if an 
examination of a cryptogram or a set of cryptograms discloses a goodly 
portion of what appear to be direct transparencies (cipher digraphs which 
could well be plaintext digraphs) it may then be assumed that a vertical 
two-square matrix has been used for the encryption. On the other hand, 
if a large number of cipher digraphs could be "good" plaintext digraphs 
if the positions of the letters were reversed , then it may be assumed that 
the cryptosystem involved a horizontal two-square matrix Sometimes 
skeletons of words or even of whole phrases are self-evident in such 
cipher text, thus affording an easy entering wedge into the cryptosystem 

b An example will best serve to illustrate the techniques of 
identification and subsequent solution of a two -square matrix cipher. 

The following naval message Is to be studied 



U 0 D L C 


ENO 


A N 


S 


I 


G L B 


B E I 


R I 


R 


C R 


G 


L 


N 


M 0 L C 


P T E P G 


R B B 


0 E 


G 


P 


ABQ 


W N N 


K S 


I 


P C 


R 


M 


0 


0 R A P 


D E A d 


A N X 


R A 


I 


E 


DAI 


R M A 


G B 


E 


K H 


S 


L 


C 


D D L C 


TQORE 


N D T 


M D 


T 


I 


A Q, F 


I E Q 


T A 


N 


N B 


F 


N 


0 


U 0 0 S 


SNUBS 


K T A 


S E 


S 


N 


H L P 


ONE 


K S 


I 


P C 


R 


C 


E 


NOIS 


ILIRK 


P L 0 


N 0 


N 


Z 


U C T 


ALT 


0 I 


I 


H 0 


C 


N 


0 


CERA 


0 S D I N 


0 E E 


K R 


L 


C 


U B R 


AOS 


D I 


I 


P D 


A 


H 


C 


0 G G R 


OLNOC 


W D I 


L P 


0 


I 


L N Q 


X D I 


G L 


R 


B B 


Q 


Y 


F 


S S R A 


V Y 0 I G 


R S L 


X X 

























Preliminary steps in analysis are made according to the procedures already 
described in this text, and the hypothesis of monographic, uniliteral en- 
cipherment (with either standard or mixed cipher alphabets) has been re- 
jected Multiliteral substitution, or digraphic substitution, comes next 



Although 625 "possible" plaintext digraphs are involved, the identity 
of digraphs actually used in plain text limit this figure considerably 
Furthermore, the frequenc iea of the plaintext digraphs actually used come 
into consideration, in conjunction with the location of the letters of 
these digraphs in any particular two-square matrix Thus, from the crypt- 
analyst's standpoint, there are "excellent" two -square matrices giving a 
high self -encipherment rate for high frequency plaintext digraphs, and 
there are "poor" two-square matrices which have a potentially high self- 
encipherment rate only foz those low frequency plaintext digraphs which 
may not occur at all in a given cryptogram 



RESTRICTE D 



190 




REF ID : A56892 



RESTRICTED 

into consideration The cipher text 



5 



uo 


DL 


CE 


NO 


AN 


SI 


GL 


PT 


ER 


GR 


BB 


OE 


GP 


AB 


DE 


AM 


HA 


?NX, 


RA 


IE 


DA 


TCI 


OR 


EN 


DT 


MD 


TI 


AQ 


SN 


NN 


RK 


TA 


SE 


SN 


HL 


HL 


IR 


KP 


LO 


NO 


NZ 


UC 




PI 


NO 


EE 


KR 


LC 


UB 


OL 


NO 


CW 


DI 


LP 


01 


LN 


VY 


01 


GR 


SL 


XX 







is written in digraphs, as follows 

10 15 



BB 


El 


RI 


RC 


RG 


LN 


MO 


LC 


QW 


m 


KS 


IP 


CR 


MO 


OR 


AP 


IR 


MA 


GB 


EK 


HS 


LC 


DD 


LC 


FI 


EQ 


TA 


NN 


BF 


NO 


UO 


OS 


PO 


NN 


KS 


IP 


CR 


CE 


NO 


IS 


TA 


LT 


01 


IH 


oc 


NO 


CE 


RA 


RA 


OS 


DI 


IP 


DA 


RC 


OG 


GR 


QX 


DI 


GL 


RB 


BQ 


YF 


SS 


RA 



62 



Hoting the 8-letter repetition 90 letters apart, the 6-letter repetition 
16 letters apart, and the 4-letter repetition at an interval of 220 let- 
ters, and that those repetitions begin on odd letters and end on even 
letters, credence is given to the grouping of the cipher text into pairs 
of letters. A digraphic distribution is then made, illustrated in Fig. 

63 
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c 
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E 


F 
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H 


1 


j 
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if 


0 


p 
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R 


s 


T 


U 


V 
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Y 
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9 
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B 
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9 


a 




B 


B 


F 




























9 








9 


B 




3 


9 


fi 


■ 


E 


E 


0 












4 










■ 


■ 




E 


■ 


■ 


■ 


5 


E 


9 


■ 


9 


■ 


9 


9 


B 


H 






■ 


■ 






■ 


■ 






■ 


9 
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Figure 63 



c The $>*?, 152, is most satisfactory when compared with (107) 

and ^ ( 23 ) Since the cryptogram has all the earmarks of a digraphic 

cipher, and no manifestations sire found to support the hypothesis of a 
multiliteial system, the next problem is the specific determination of the 
particulai kind of digraphic* system involved It may be noted tnat there 
are quite a few digraphs in the cipher text which resemole good plaintext 
digiaphs, proportionally more so than, for instance, m the cryptogram m 
subpar 69 c, the cryptologic finger points to the possibility of e two- 
square system However, since the words "good digraphs" are serranuicalJy 
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elus ive, lei us attempt to determine statistically wnethei or not a two- 
Square System might he involved &M, if a two-sqUafe, whether it is more 
probably a Vertical or a hori^ohtaJ two -square 

d First i fof the pUi*po§§ 0f determining whether '^direct trans- 
ports ie s'* or ’'inverse trahspaf eh.2 ie s predominate in this cryptogram.} the 
digraphs df the distribution in Fig 63 will he set down in tabular form, 
With an Indication of their frequency ih the cryptogram., and with data 
relative to the probability of these ‘digraphs as' plaintext digraphs , and 
a§ plaintext digraphs fallen FeVer s§dT In the table on page 19 b, col (l) 
is a listing of the ciphertext digraphs j 661 ( 2 ) is the frequency of the 

ciphertext digraph as it occUf§"in the cryptogram, cdl ( 3 ) is the 3oga~ 
rithfil Of the theoretical plaihtext frequency of the particular digraph 
(from fable 15} Appendix 2)’ T~Col (h) represents the products of the 
entries in cols (2) end (3)j col; (3) is the logarithm of "the theoretical 
plaihtext frequency of the reversed digraph (frola fable 15 > Appendix 2), 
axid col. (6) represents the products df the Shtries itt cols. (2) and (*>). 
Froifi this, the sum of the valued in coi. (4), 5Q*3^> is taken to be the 
,( direct tranSparehdV' 1 Value ) &hd the slM 5f the Values in 261; (6), 63 . 02, 
is taken to be the "inVerSe traftspdfSfiSy 1 ' value. fhUs, since this par- 
ticular cryptogram has' An 11 inverSe transparency 1 ' Value which is higher 



07 — •" 

fhe test to be described in the following subparagraphs is based on 
aH evaluation 8 f those instance § faherein the bbSerVCd frequency of any 
particular ciphertext digraph approximates the freqUChdy with whicn the 
particular digraph} or its reVSFsal, voUld be 'expeStbd to ocCur if con- 
sidered as a plaintext digraph Any s\ich corfSlatidh Which occurs m a 
fouf-Bqllard or Playfair cipher, or ill I SPyptOgfain produced by a large 
randomized digraphic table, is puheiy accidental becaUsd it is not a 
resUlt of the mechanics Of the system KoweVer , in two-sqUare cf yptogretfh.% 
shch correlation is c&used by_ the MSchaniSS 6 f the system in the encipher- 
ment of 20 % of the possible plaintext digraphs) alid these causal instances 
of correlation Scour in addition to §ny accidental instahcds fahich may 
arise In the encipherfnent of the remaining 8056 Thus, if a digraphie 
cipher Exhibits mOrdly the random feXpdStatioh 6 f c of rhlat lofi both when the 
particiildf ciphertext digiaphs are considered as they afe 'and when their 
reVeFSals are' considered, the cryptogram Shy be assumed to involve a 
system Sther than tvo-sqUaf§ If A digidphic cipher exhibits more than 
the random expectation of corf'e latiofi, hither faheh the parVicUlar digraphs 
§£§ considered direct or when coHfoid'ered £§Vdfs3&, it may he assWffid to 
irlVoiVe tw8~sqtiafe SEcipherment , and the paht icUD ar __ cbSsidex-ation — that of 
the digraphs direct of that of the digraphs reversed — whach gives £lse to 
the greater degree of correlation indicated whether the cryptogram involves 
a Vertical two-§<piaFe 8r §. hofiz6fital two-sqUare , ’respectively 
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(1) 


(2) 


(3) 


(4) 


(6) 


(6) 


(1) 


<2) 


(3) 


(4) 


(S) 


(8) 


(1) 






(4) 


(0) 


(6) 


AB 


1 


45 


0,46 


.38 


0. 38 


HA 


1 


.67 


0,67 


,25 


0.25 


OR 


2 


,80 


1,78 


.74 


1,48 


AM 


1 


,01 


0,01 


,78 


0,78 


HL 


2 


.13 


0,26 


,13 


0,26 


OS 


3 


.61 


1,83 


.62 


1.80 


AN 


1 


.89 


0,89 


.72 


0.72 


HS 


1 


,38 


0,38 


.72 


0.7? 


FO 


1 


.64 


0.64 


,72 


,0,72 


AP 


1 


,58 


0. 58 


.61 


0.01 


335 


1 


.59 


0.50 


.73 


0.73 


rr 


1 


• 51 


0.61 


.26 


0.26 


AQ 


1 


.00 


0.00 


.00 


0.0Q 


III 


1 


,00 


0,00 


.77 


0.77 


QW 


1 


.00 


0.00 


.00 


0.00 


BB 


2 


.00 


0,00 


,00 


0,00 


IP 


3 


.48 


1.44 


.45 


1,35 


ox 


1 


,00 


0. 00 


,00 


0,00 


BF 


1 


,00 


0.00 


,00 


0,00 


IR 


2 


.73 


1.46 


.75 


1,50 


RA 


4 


o 

OO 


3.20 


,82 


3.28 


BQ 


1 




0.00 


• 

O 

O 


0,00 


IS 


1 


.78 


0.78 


.77 


0,77 


KS 


1 


,25 


0,26 


.35 


0,25 


CE 


3 


16 


2,28 


,78 


2.28 


KF 


1 


.00 


0.00 


.00 


0,00 


RC 


2 


,63 


1. 06 


.38 


0,70 


CR 


2 


co l 

CO 

• 


0,76 


.53 


1. 06 


KR 


1 


• 

o 

o 


0.00 


.13 


0,13 


RG 


1 


.48 


0.48 


.42 


0.42 


CW 


1 


.13 


0.13 


,00 


0.00 


KS 


2 


.13 


0,26 


.13 


0,26 


RX_ 


1 


.75 


0.76 


,73 


0.73 


DA 


2 


.76 


1 52 


,73 


1.46 


LC 


4 


.33 


1.32 


.42 


1.08 


RK 


- 1 


.13 


0,13 


.op 


0.00 


DD 


1 


.51 


0.51 


.51 


0.51 


IN 


2 


.13 


0,26 


.42 


0. 84 


SE 


1 


,84 


0. 84 


.80 


0.86 


DE 


1 


.77 


0.77 


.88 


0.88 


10 


1 


.59 


0.59 


,67 


0.67 


SI 


1 


,77 


0.77 


.78 


0,78 


DI 


4 


.73 


2 92 


45 


1.80 


LP 


1 


.33 


0.33 


.59 


0.59 


SL 


1 


,25 


0,25 


.45 


0.45 


DL 


1 


CO 

CO 

* 


0. 33 


.63 


0.63 


nr 


1 


.51 


0,51 


.42 


0,42 


SN 


2 


.38 


0,76 


.71 


1.42 


DT 


1 


.62 


0. 62 


.45 


0.45 


MA 


1 


.78 


0.78 


.61 


0.61 


ss 


1 


.67 


0,67 


,67 


0,67 


EE 


1 


a> 

OO 


0. 81 


,81 


0.81 


MD 


l 


.13 


0.13 


,42 


0,42 


TA 


3 


.74 


2.22 


.83 


2.49 


El 


1 


.73 


0.73 


.59 


0.59 


MO 


2 


55 


1.10 


.72 


1.44 


TI 


1 


* 

CC 

CO 


0. 82 


73 


0 IS 


EK 


1 


.00 


0,00 


,45 


0.45 


M 


4 


.51 


2,04 


.51 


2.04 


TQ, 


1 


.13 


0.13 


.00 


0.00 


EN 


1 


.99 


0.99 


.87 


0.87 


NO 


7 


.66 


4.62 


.92 


5,74 


UB 


1 


.33 


0,33 


.25 


0,25 


EQ 


1 


.58 


0,58 


.00 


0.00 


NX 


1 


• 

O 

O 


0.00 


.13 


0,13 


UC 


1 


,33 


0.33 


OO 

CO 

• 


0 38 


ER 


1 


.94 


0.94 


.96 


0. 96 


NZ 


1 


.00 


0. 00 


.00 


0,00 


UO 


2 


.13 


0,26 


.79 


1.58 


FI 


1 


• 

OO 

o 


0.8Q 


.55 


0.65 


OC 


1 


.51 


0,51 


O 

CO 

• 


0,80 


VY 


1 


O 

O 

* 


0,00 


,00 


0.00 


GB 


1 


.00 


0,00 


.00 


0,00 


OE 


1 


33 


0. 33 


.58 


0.58 


XX 


1 


• 

o 

o 


0.00 


o 

o 

• 


0.00 


GL 


2 


.25 


0,50 


.13 


0.26 


OG 


1 


.25 


0.25 


.45 


0,45 


YF 


1 


,56 


0. 56 


.13 


0.13 


GP 


1 


.25 


0.25 


.00 


0.00 


01 


3 


,42 


1.26 


.80 


2. 40 




125 




58,34 




63. 02 


GR 


3 


.42 


1.26 


_48 


1.44 


OL 


1 


67 


0 67 


.59 


0.59 















(1) Identity of cipher digraph appearing in the cryptogram. 

(2) Frequency of the particular digraph as it occurs in the cryptogram 

(3) Logarithm of theoretical plaintext frequency of the particular digraph (from Table 15, Appendix 2), 

(4) Product of entries in columns (2) and <3). 

(5) Logarithm of theoretical plaintext frequency of the digraph's reversal (from Table 15, Appendix 2). 

(6) Product of entries In columns (2) and (5), 
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than the "direct transparency" value, it may be assumed*^ to involve a 
horizontal two-square-~if , indeed, two-square encipherment has been em- 
ployed It is now for us to establish whether or not this latter is the 
case, and this will be done by determining whether or not the foregoing 
observed value, 6 3 02, is representative of the degree of tiansparency 
which may be expected in a horizontal two 1 - square cipher, (if the "direct 
transparency" value had been the higher of the two, then it would have 
been more probable that a vertical two-square were involved, and it would 
be necessary to determine whether or not this observed value was repre- 
sentative of the degree of transparency expected in a vertical two-square 
cipher) 

e The observed "Inverse transparency" value (selected in this case 
because it is the higher observed value) will be compared with the value 
expected from a horizontal two-square cryptogram of the same size, and if 
this observed value is as great as or greater than the transparency value 
expected for horizontal two-squares, the cryptogram may be considered to 
be a horizontal two-square cipher, if the observed value is lower than 
the expected two-square value, decision will have to be suspended The 
transparency value expected in a horizontal two-square cipher containing 
N digraphs is computed by multiplying N by .3388? which in this case 



"~2S 

Actually, if the two-square hypothesis is made, the difference be- 
tween the horizontal two-square value and the vertical two-square value 
will indicate the degree of probability of the higher score over the 
lower In this case, the difference of 4 68 (= 63 02 - 58 34), which 
represents a difference of log scores , is equivalent to an overwhelming 

ratio of 100 billion to 1 (i.e , 224 ^ to l) in favor of the hypobhesis 
of a horizontal two-square The foregoing computation involves an aspect 
of mathematics which will he given detailed treatment in Military Crypt- 
analysis, Part III 

29 _ 

For the benefit of the student with a background in statistics, it 

is pointed out that by abiding by the stipulation "as great or greater", 
some cryptograms which actually are the result of two-square encipherment 
may be rejected by this stipulation, hut it will insure that only a rela- 
tively few non-two- square cryptograms will be accepted A better approach 
of a statistical nature would involve, first, computing bhe expected 
value for non- two -squares as “well as that for two-squares Then, any 
observed value falling below the expected two-square value could he 
expressed in terms of the number of standard deviations (i e , bhe 
sigmage) from this expected two -square value and, from the expected non - 
two-square value. Finally, the particular expected value which would he 
considered as significant would be the one from which the observed value 
differed by the smaller number of standard deviations The concept of 
standard deviation will be treated in Military Cryptanalysis, Part XII. 
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yields 42 35 (= 3388 x 125). 3° The observed value for the cryptogram, 

63 02, is much higher than the expected value, 42 35* Thus, it has been 
proven statistically that the cryptogram at hand Involves two-square 
encipherment, particularly, horizontal two-square encipherment 

f Having now proved that the cryptogram at hand is a horizontal 
two-square cipher, the next step is to assume some plain text in the 
message, guided by probable Inverse transparencies ( inverse because the 
system has been identified as a horizontal two-square) in the cipher text. 
Referring to the work sheet in Fig 62, the repeated sequence at B9 and 
E9 is- assumed to represent the plain text TA SK F0 RC (E~), on the basis 
of KS c =SKp, and CR c =RCp The plaintext -ciphertext values are now 



30 

In the case of vertical two-squares, N would be multiplied by the 
constant 3^10 The mathematical considerations underlying this test and 
their r^iofs (involving Bayes* theorem and Bayes* factors) are beyond the 
scope this text, however, for the benefit of the mathematician, the 
derivation of the foregoing constants :s explained below, along with the 
derivation of the constant used for computing the expected transparency 
value for non -two-squares In the formulas, below, 

£ 

= the summation over all digraphs AA-ZZ 

F^g = the frequency of a given digraph AB as found m Table 6A, 
Appendix 2 

**££ = the logarithm (to the base 224) of the frequency of a given 
digraph AB as found in Table 15, Appendix 2 



For vertical two-squares, 

k: §.* m [• 8 °(- 0015) + -w e ]= 3610 

For horizontal two-squares, 

k = £. << r 80( 0015) + — — 1 = 3388 

AB BA L 5° 00 J 



For non -two 
k 



square digraphic 

= = 2737 

676 



sys terns , 
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31 

recorded in a skeleton reconstructibn diagram as illustrated in Fig. 
64a. At A3, the assumption of (-R) EC ON NA IS SA NC (E-) is tossed off 
without much ado, since four of the six digraphs concerned are trans- 
parent. The plain-cipher relationships from this assumption are added 
to the reconstruction diagram, as shown in Fig. 64h. Continuing in this 
vein, the plain text (-A) IR CR AF (T-) is inserted at A10, and the plain 
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Figure 64a. 
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Figure 64b. 



text (-B) AT TL ES HI (P-) is inserted at J3, the successive cumulative 
reconstruction diagrams for these two assumptions are shown in Figs « 64c 
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Figure 64c . 



During the reconstruction of the squares of the matrix, the student 
should keep clear in his skeleton diagram which letters are in the same 
row, and which are in the same column. It will he found expeditious to 
draw a dividing line (either horizontal or vertical, depending on the 
type of two -square matrix involved) on the page to keep the elements of 
the two squares independent, recording the values which are in the same 
row or column and writing down the letters as they are assumed. In the 
early stages of this process the student must exercise care in recording 
the letters so that no falBe relationships are formed, in other words, 
the values should he written down so that they are not in the same row or 
column with any letters other than those with which they are known to he 
related. This will entail spreading the work rather widely over the page 
initially, then gradually telescoping and reducing the size of the re- 
construction diagram as the work progresses, until in the end it will he 
reduced to a concise matrix of two 5x5 squares. 
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and 64d "below. It is to "be noted that at J7 > 0C„ =P8p; "but since in 

Fig. 64d it has already been determined that 0C c =9Sp, then 0C c must equal 

PS aking the word BATTLESHIPS rather than in the singular. 

* 

g. At this point the partially filled-in work sheet will look as 
follows : 



5 



A 


UO 


DL 


CE 


NO 


AN 


SI 


GL 






-R 


EC 


ON 


NA 


IS 


SA 


B 


FT 


ER 


GR 


BB 


OE 


GP 


AB 






RE 


-E 


NC 


EO 


-fe 


NE 


C 


DE 


AM 


HA 


NX 


RA 


IE 


DA 












AR 


-0 




D 


TQ 


OR 


EN 


DT 


MD 


TI 


AQ 






RO 


BA 






IT 




E 


SN 


NN 


RK 


TA 


SE 


SN 


HL 




NS 


TA 




AT 


TO 


NS 




F 


HL 


IR 


KP 


LO 


NO 


NZ 


UC 






-0 




OL 


ON 




-B 


G 


, OS 


DI 


NO 


EE 


KR 


LC 


UB 




<’ 




ON 


EE 








H 


OL 


NO 


CW 


DI 


LF 


01 


LN 




-S 


ON 








ES 


T- 


J 


VY 


01 


GR 


SL 


XX 










ES 


-E 


LS 









10 15 



BB 


El' 


RI 


RC 


RG 


LN 


MO 


LC 


NC 


EA 


IR 


GR 


AF 


T- 


E- 




QW 


NN 


KS 


IP 


CR 


MO 


OR 


AP 




TA 


SK 


FO 


RC 


E- 


RO 


-E 


TR 


MA 


GB 


EK 


HS 


LC 


DD 


LC 


"0 




-E 












FI 


EQ 


TA 


NN 


BF 


NO 


UO 


OS 


-R 




AT 


TA 




ON 






PO 


NN 


KS 


IP 


CR 


CE 


NO 


IS 




TA 


SK 


FO 


RC 


EC 


ON 




TA 


LT 


01 


IH 


OC 


NO 


CE 


RA 


AT 


TL 


ES 

t 


HI 


PS 


ON 


EC 


AR 

J 


RA 


OS 


DI 


IP 


DA 


RC 


OG 


GR 


AR 






FO 




CR 




-E 


ox 


DI 


GL 


KB 


BQ 


YF 


SS 


RA 






SA 


N- 






L~ 


AR 
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Skeletons of additional plain text, such as the word OUR at Al, PRESENCE 
OF ENEMY at Bl, PROBABLE at Dl, ATTACK ON OUR INSTALLATIONS at BIO, 
CARRIER at Fl4, and VESSELS at Jl, may now clearly be seen The complete 
recovery of the plain te-ct follows, and the reconstruction diagram is 
completed and telescoped into the form shown in Fig 64e Since phe- 
nomena of keyword-mixed sequences are observed, the row's and columns of 



Figure 64e " * Figure 64f 

Fig 64e are permuted to yield the original two -square matrix as shown 
in Fig 64f — ~ 

I > * V- 

h The solution of vertical two-square systems follows analogous 
lines, with the necessary modifications of the reconstruction diagram in 
consonance with the difference in mechanics between horizontal and verti- 
cal two -square systems 

i A few additional remarks concerning the test applied m subpars 
d and e, above, are in order First, the exceptionally high transparency 
value observed in this cryptogram is a direct result of the very favor- 
able manner in which the keyword-mixed sequences in the two squaibs inter- 
act, m the foregoing cryptogram, 47 of the 325 digraphs present (approx 
38 $) were inverse transparencies It is also pointed out that, although 
some actual two -square cryptograms may be rejected by that portion of the 
test which was described in subpar e, the other phase of the test (de- 
scribed in subpar d) — by which one may determine whether a cryptogram is 
more probably a vertical two-square encipherment or more probably a hori- 
zontal two-square encipherment — is extremely sensitive and highly accu- 
rate The foregoing statistical method is not merely valuable per se as 
an application of cryptomathematics m the analysis of two-square matrix; 
systems, but is included as being illustiative of the general principles 
of special techniques that may be developed in the attack on any particu- 
lar cryptosystem, the mechanics of which are known to the cryptanalyst 
The field of actual operational cryptanalysis is replete with special 
methods of attack of this nature ~ 

— I - 

71 Analysis of Playfair cipher systems --a ~'0f all digraphic 
cryptosystems employing small matrices, the one which has been most 
frequently encountered is the Playfair cipher Certain variations of 
this cipher have been incorporated m several complex manual ciphers 
used in actual operational practice, because of this it is important that 
the student gain familiarity with the methods of solution of the classic 
Playfair system 
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b The first published solutions-^ for this cipher are quite 
similar basically and vary only in minor details The earliest, that by 
Lieut. Mauborgne (later to become Chief Signal Officer of the U.S. Array), 
used straightforward principles of frequency bo establish the values of 
three or four of the most frequent digraphs Then, on the assumption 
that in most cases in which a keywoid appears on the first and second rows 
the last five letters of the normal alphabet, VWXYZ, will rarely be dis- 
turbed in sequence and will occupy the last row of the square, he "jug- 
gles" the letters given by the values tentatively established from 
frequency considerations, placing them in various positions in the square, 
together with VWXYZ, to correspond bo the plaintext-ciphertext rela-cion- 
ships tentatively established A later solution by Lieut Frank Moorman, 
as described in Hitt's manual, assumes that in a Playfair cipher prepared 
by means of a square in which the key word occupies the first and second 
rows, if a digraphic frequency distribution is made, it will be found 
that the letters having the greatest combining power are very probably 
lette )f the key A still later solution, by Lieut Commander Smith, 
is peihaps the most lucid and systematized of the three He sets forth 
in definite language certain considerations which the other two writers 
certainly entertained but failed to indicate 

c The following details have been summarized from Smith's so- 
lution 

(1) The Playfair cipher may be recognized by virtue of the fact that 
it always contains an even number of letters, and that when divided into 
groups of two letters each, no group contains a repetition of the same 
letter, as UN or EE Repetitions of digraphs, trigraphs, and. polygraphs 
will be evident m fairly long messages 

(2) Using the square^ shown in Fig 65 , there arc two general cases 
to be considered, as regards the results of encipherment 
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Figure 65 



32 

Mauborgne, Lieut J. 0., U S.A An advanced problem in cryptography 
and its solution, Leavenworth, 1914 , 

Ilitt, Captain Parker, U.S.A Manual for the solution of military 
ciphers , Leavenworth, 1918 

Smith, Lieut Commander W. W., U.S.K. In Cryptography by Andre 
Langie, translated by J.C.H. Macbeth, New York, 1922 

33 

The Playfair square accompanying Smith's solution is based upon the 
key word BANKRUPTCY, "to be distributed between the first and fourth lines 
of the square" This is a simple departure from the original Playfair 
scheme in which the letters of the key word aie written from left to right 
and m consecutive lines from the top downward 
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of a rectangle T?ho following 



Reciprocity and reversibility ^ 

Case 2 Two letters in 
lustrative relationships are 

But RKp does not = AN C , 

Reversibility only 

(3) The foregoing gives 

Rule I (a) Regardless of the position of the letters in the 
square , if 



the seme row or column. The following il- 
found 

nor does KNp=RA c . 



rise to the following 



Case 1 Letters at opposite comers 
3 llustrative relationships are found 



TH^YF,, 



YFf,sTH n 
FXl=IIT rt 



1 2l3 4, then 

2 114 3 

This rule is of particular aid in selecting probable words in the 
solution of Playfair ciphers, as will be shown shortly 35 

(b) If 1 and 2 form opposite corners of a rectangle, the 
following equations obtain 

„ , - 1 2=3 4 

2 1=4 3 

3 4=1 2 

4 3=2 1 



oil 

D By way of explaining what is meant by reciprocity and by reversi - 
bility , in the case of digraphic systems, the following examples are 
given THp=IF e and YFp-THc constitute a reciprocal relationship, 

THptYF c and HTp3?Y c constitute a reversible relationship 
oc 

J In this connection, a list of frequently-encountered words and 
phrases which contain reversed digiaphs (so-called "ABBA patterns") has 
been compiled and is included as Section E, "Digraphic ldJomorphs. 
Playfair", in Appendix 3 
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(U) A letter considered as occupying a position in a row can be com- 
bined with but four other letters in the same row, the same letter con- 
sidered as occupying a position m a column can be combined with but four 
other letters in the same column. Thus, this letter can be combined with 
only 8 other letters all told, under Case 2, above But the same letter 
considered as occupying a corner of a rectangle can be combined with 1 6 
other letters, under Case 1, above Smith derives from these facts the 
conclusion that "it would appear that Case 1 is twice as probable as 
Case 2" . He continues thus (notation my own) • 



"Now in the square, note that 



ANpsNK, 

GNp=FI£ 



OI C s KKq 

CBpSTKp 

XMp=>/K c 



also 



EIL=FA„ 

EjM1=FL„ 

EWl=FV c 

EFJ=FG C 



"Fiom this it is seen that of the 2k equations that can be formed 
when each letter of the square is employed either as the initial or final 
letter of the group, five will indicate a repetition of a corresponding 
letter of plain text 



"Hence, Rule II After it has been determined, in the equation 

1 2=3 4, that, say, ENp=FA c , there is a probability of one in five that 

any other group beginning with F indicates E©_,, and that any group 
ending in A c indicates ©Np 3° 

"After such combinations as ERp, ORp, and ENp have been assumed or 

determined, the above rule may be of use m discovering additional di- 
graphs and partial words' 1 



1 > S 

3 5 

There is an error in this reasoning Take, foi example, the 24 equations having F as an initial letter 



Cose 

1 


FB„=DNp 


Case 

2 


TE=ED 


Case 

2 


FT=NM 


Case 

I 


FX=GW 


2 


FD =EH 


1 


FL=EM 


2 


FW=NT 


1 


FR=HN 


1 


FI =DM 


1 


FP=ET 


1 


nf=GN 


2 


FH=EG 


1 


FU =DT 


1 


FV=EW 


2 


FG=Er 


1 


FQ=HM 


1 


FS =DW 


2 


FN=NV7 


1 


F0=GM 


1 


FY=HT 


1 


FA =EN 


2 


ra=NF 


1 


FC=GT 


1 


FZ=HW 



Here, the initial litter F, represents the following initial letters of plain-text digraphs 

DOp, EG, NOp, GOp, and H9 D 

It is seen that F c represents D OI N 0 G p , H, 4 times each, and E p , 8 times Consequently, supposing that it has 
been determined that FA„=ENp, the probability that F„ will represent E B is not 1 in 5 but 8 m 24, or 1 in S, but 
supposing that it has been determined that FW 0 =NT yl the probability that F„ w ill represent is 4 in 21 or 1 in 6 
The difference In these probabilities is occasioned by the fact that the first instance, FA 0 =EN„ corresponds to a 
Case 1 encipherment, the second instance rW 0 =NTp, to a Case 2 encipherment But there is no waj of knowing 
Initially, and without other data, whether one is dealing with a Case 1 or Case 2 encipherment Oulj as an 
approximation then fore, maj one sax that the piobabihtv of F„ repre enting a given 9 P is 1 in 5 A probability 
of 1 m 5 ib of almost trivial importance in this sduation -mce it lepresents -uch a long shot ' foi success Tilt 
following rule might be preferable If the equation 12 = 3 1 has been established where all the letteis represented 
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Rule III In the equatiofi 1 2r3<4, 1 and 3 can never be identical, 
nor can 2 and 4 ever be identical Thus, Aifp could not possibly be 
represented by AY C , nor could ERp be represented by KR C This rule is 
useful in elimination of certain possibilities when a specific message 
is being studied 

Rule IV In the equation 1.2p:3«4 c , ^ 2 and 3 are identical, the 
letters are al3 m the same row or column, and in the relative order 

1- 2-4 from left to right or top to bottom, respectively In the square 

shown, AM =NK C and the absolute order is AilK The relative order 1-2-4 
includes five absolute orders which are cyclic permutations of one 
another Thus' AM. NK. .A, K. .AW, ..AM, and .AM 

Rule V In the equation 1 2p=3»4 c , if 1 and 4 are identical, the 
letters are all m the same row or column, and in the relative order 

2- 4-3 from left to right or top to bottom In the square shown, KM dRK c 

and the absolute order is MR The relative order 2-4-3 Includes five 
absolute orders which are cyclic permutations of one another Thus 
MR.., KR .E, R..M, ..MR, and .NKR. . 

Rule VI "Analyze the message for group recurrences. Select the 
groups of greatest recurrence and assume them to he high-frequency di- 
graphs. 37 Substitute the assumed digraphs throughout the message, test- 
ing the assumptions in their relation to other groups of the cipher The 
reconstruction of the square proceeds simultaneously with the solution of 
the message and aids in hastening the translation of the cipher". 



by 1, 2, 3, and 4 are different, then there is a probability of 4/5 that a Case 1 encipherment is invoked Conse- 
quently, if at the same time another equation, 3 6 = 5 2, has been established, where 2 and 3 represent the same 
letters as in the first equation, and 5 and 6 are different letters, also different from 2 and 3 there is a probability 
of 16/25 that the equation 1 6 = 5 4 is valid or if at the same time that the equation 12 = 34 has been detei mined, 
the equation 1 6=5 4 has also been established, then there is a probability of 16/25 that the equation 3 6 = 5 2 is 

12 8 4 3 8 5 2 10 54 

valid (Check this by noting the following equations based upon Tig 25a CE = PG, PH = YE, CH=YG Note 
the positions occupied in Fig 25a by the letters nHolved) Likewise, if the equations 1 2=3 4 and 1 6 = 3 5 
have been simultaneously established, then there is a probability that the equation 2 5 = 4 6 is valid, or if the 
equations 1 2 = 3 4 and 2 5 = 4 6 haxe been simultaneously established, then there is a probability that the 

12 34 13 35 36 46 

equation 2 5 = 4 6 is valid (Check this by noting the following equations CE = PG, CA = PK, EK = GA, note the 
positions occupied in Fig 25a by the letters involved ) Howeiei, it must be added that these probabilities are 
based upon assumptions which fail to take into account any considerations whatever as to frequency of letters 
or specificity of composition of the matrix For instance, suppose the 5 high-frequency letters E, T, R, I N all 
happen to fall in the same row oj column m the matrix, the number of Case 2 enCipheiments would be much 
greater than expectancy and the probability that the equation. 12 = 3 4 represents a Case 1 encipherment falls 
much below 4/5 

37 

A more accurate guide to the determination of the plaintext equiva- 
lents of high-frequency cipher digraphs would involve the consideration 
of the difference in frequency of a particular digrapn and its reversal 
Thus, an example of a high-frequency 60p which is also high-frequency in 

its reversal, is REp, an example of a high-frequency 88p which is rarely 
found in its reversed form, is THp 



ftEOTRICTED 



203 




REF ID:A56892 



• RESTRICTED 

d (l) When solutions for the Playfair cipher system -were first 
developed, based upon the fact that the letters were inserted m the 
cells in keyword -mixed order, cryptographers thought it desirable to 
place stumbling blocks in the path of such solution by departing from 
strict, keyword -mixed order One of the simplest methods is illustrated 
in Fig 65, "wherein it will be noted that the last five letters of the 
keyword proper are inserted in the fourth row of the square instead of 
the second, where they would naturally fall. Another method involves 
inserting the letters within the cells from left to right and top down- 
ward but using a sequence that is derived from a columnar transposition 
instead of a keyword -mixed sequence Thus, using the keyword BANKRUPTCY . 

215 4 79683 10 

BANKRUPTCY 

DEFGHILMOQ, 

S V W X z 

Sequence AEVBDSCOKGXNFWPLRHZTMUIYQ 
The Playfair square is as follows 



A E V B D 
S C 0 K 0 
XNFWP 
L R II Z T 
M U I Y Q 



Figure 66a 

( 2 ) Note the following three squares 



0 K G S C 
FWPXN 
H Z T L R 
IYQHU 
V B D A E 



Figure 66b Figure 66c 

At first glance they all appear to be different, but closer examination 
shows them to be cyclic permutations of one another and of the square m 
Fig. 66a. They yeld identical cryptographic equivalents m all cases 
However, if an attempt be made to reconstruct the original key word, it 
would be much easier to do so from Fig 66a than from any of the others, 
because in Fig 6 b a rne original keyword-mixed sequence has not been 
disturbed as much as in Figs 66b, c, and d In woikmg with Playfair 
ciphers, the student should be On The lookout for such instances of 
cyclic permutation of the original Playfair square, for during the course 
of solution he will not know whether he is building up the original or an 
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equivalent cyclic permutation of the original matrix, Usually only after 
he has completely reconstructed the matrix will he he able to determine 
this point 

e (l) The steps in the solution of a typical example of this 
cipher will now he illustrated Let the message he as follows 
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( 2 ) Without going through the preliminary tests in detail, with 
which it will he assumed that the student is now familiar , 3 the con- 
clusion is reached that the cryptogram is digraphic m nature, and a 
digraphic frequency distribution is made (Fig 67) 
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Figure 67 



Since there are no double-letter groups (termed "doublets"), the 
conclusion is reached that a Playfair cipher is involved and the message 
is rewritten in digraphs 
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(3) 


The following three fairly 
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» 






- 




— 




ft 







The first long repetition, with the sequent reversed digraphs CX and CC 
Immediately suggests the word BATTALION X see Section E, Appendix 3) .> 
split up into -B AT TA LI ON and the sequence containing this repetition 
in lines F and G becomes as follow , , r < 

Line F OX OT UZ FA CX XC FZ XH CY NO TY 

- B AT TA LI ON 

Line G YA TE XH FA CX XC FZ XH YC TX \TL 

B AT TA LI ON 

(4) Because of the frequent use of numerals before the word BAT- 
TALION (as mentioned m Section B of Appendix 4) and because of the ap- 
pearance of ONp before this word m line G, the possibility suggests 
itself that the word before BATTALION in line G is either ONE or SECOND. 
The identical cipher digraph FA in both cases gives a hint that the word 
BATTALION in line F may also be preceded by a numeral, if ONE is correct 
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in line G, then THREE is possible in Dine F On the other hand, if 
SECOND is correct ixfline G, then THU® is possible in line F Thus 



Line F 


. OX 


OT 


uz 


FA 


CX 


XC 


PZ 


XH 


CY 


NO 


TY 


1st hypothesis 


MM 


TH 


RE 


EB 


AT 


ta' 


LI 


ON 








2nd hypothesis 




TH 


IR 


DB 


AT 


TA 


LI 


Oil 








Line G 


YA 


TE 


XH 


FA 


CX 


XC 


PZ 


XH 


YC 


TX 


1JL 


1st hypothesis 


— 


— 


ON 


EB 


AT 


TA 


LI 


ON 








2nd hypothesis 


. -S 


EC 


ON 


DB 


AT 


TA 


LI 


ON 









First, '■e that if either hypothesis is true, then OT c rTHp The 
frequenwy distribution shows that OT occurs 6 tiroes and is m fact the 
most frequent digraph in the message Moreover, by Rule I of subpara- 
graph b, if OT c =THp then TO c =HTp Since HTp is a very rare digraph in 

normal plain text, T0 C should either not occur at all in so short a 
message or else it should be very infrequent The frequency distribution 
shows that it does not occur Hence, there is nothing inconsistent with 
the supposition that the word in front of BATTALION in line F is THREE or 
THIRD, and there is some evidence that it is actually one or the other 

-f V - ' 

(?) But can evidence he found for the support of one hypothesis 
against the other? Let the frequency distrioutlon be examined with a 
view to throwing light upon this point If the first hypothesis is true, 
then UZ c =REp, and, by Rule I, ZU c =ERp The frequency distribution shows 

hut one occurrence of UZ C and but two occurrences of ZU C These do not 

look very good for RE and ER On the other 'bond, if the second hypothesis 
is true, then UZ c =IRp and by Rule I, ZU c =RIp The frequencies are much 

moie favorable in this case Is there' anything 3 neons latent with the 
assumption, on the basis of the second hypothesis, that TE c =ECp? The 
frequency distribution shows no inconsistency, for TE n oc curs once and 
ET c (=CEp, by Rule I) occurs once As regards whether FA c =EBp or rlftp, ~ 

both hypotheses are tenable , possibly the second hypothesis is ** ^nade 

better than the first, on the following reasoning By Rule 1^ if FA c =E3p 
then AF c =BEp, or if FA c =DB p then AF c iBDp The fact that no AF C occurs, 

whereas at least one BE_ may he expected in this message, inclines one to 
the second hypothesis, since BDp is very rare 

(6) Let the 2nd hypothesis be assumed to be correct The additional 
values arc tentativelv inserted in the text, and in lines G and K two 
inteiestmg repetitions aie noted 1 

Line G Hi Sli XC FT OT CX OT TC YA TD XH FA CA YC PZ XII 

TA 1 i i T TH -S EC OH DB AT TA LI OH 

Lire K MO ICB XC FT OT C f OT IH FY D T TG XI TC OL XU LT 

TA US 7 T 55 
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This cej to inly looks like STATE THAT THE . . , vhich would make IE -PT C 

Furthermore, in lino G the sequence STATETHATTHE , SI COI'tDBAl^ALIOW c.*n 
hardly be anything eloeJ,ha£_STATE THATJTI^TR SECOND BATTALION, which 
would mcke 'IC c =EI p and YA c =RS p Also &M C =-S p 

(T) It is perhaps high time that the whole list of tentative equiva- 
lent values be studied in relation to their consistency witn the positions 
of letters in the Playfair square, moreover, by so doing, additional 
values may be obtained in the process The complete list of values is 
as follows 



Assumed values 


Derived by Rule I 


S>=CXc 

Vk 


^p = ^c 

Itp-ZPc 


ONpZXHc 


NOpZlKc 


TH p ::pT c 


HTp=TO c 


IRplUZf. 






BDp=AF c 


FC p =TE c 


CEp=ET c 


TE p rPT c 


ETp=TPc 


EIp“TC c 


I&pzCTc 


5§p=YAc 


SRpZAYc 


-Sp=SM c 


S-pzMS c 



(8) By Rule V, the equation THp=OT c means that H, 0, and T are all 
in the same row or column and in the absolute order HTO similarly, C, E, 
and T are in the same row or column and in the absolute order CET 
Further, E, P, and T are in the same row and column, and their absolute 
order is ETP That is, these sequences must occur someplace m the square, 
in either rows or columns, taking into consideration of course the proba- 
bility of cyclic displacements of these sequences within the square 

_r - 1 rf # % 

» j x 

(a) H T 0 ^ -_(h) CET w (c) E T P 

i 

(9) Noting the common letters E and T in the second and third se- 
quences, these two sequences may he combined into orie sequence of four 
letters, viz , C E T P Since only one position remains to he filled in 
this row (or column) of the square, and noting m the list of equivalents 
that EI_=TC c , it is obvious that the letter I belongs to the CETP 
sequence, the complete sequence is therefore C E T P I 

(10) Since the sequence HTO has a common letter (T) with the sequence 
CETPI, it follows that if the HTO sequence occupies a roj, then the 
CETPI sequence must occupy a column, or, if the HTO sequence occupies a 
column, then the CETPI sequence must occupy a row, and they may he com- 
bined by means of their common letter, T, viz 



C E 



H 

T P 
0 



1} 



or 



. 



C 

E 



H T 0» 



I-J 
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The pi oof of whether the CETPI sequence, fox example, properly belongs as 
a row or a column of the Playfair square lies m the establishment of a 
rectangular relationship, instead of the linear relationships constructed 
thus far. 



(ll) We note that, from the assumptions in subpar d(6), Afp=CX c and 

ONpnXHc The relationship ONp=XH c might be either a rectangular one, such 
as 0 . yX, or it might be linear, viz , HTOXN or H Since however 

T 
0 
X 
N 



0 .X, or it might be linear, viz 

iX* 

H N 



AT p =CX c must be a rectangular relationship, then only the configuration 



« C A ; 


1 

ic E 


H 




} E • 


T P 


l| 


jH T 0 X Nj will be valid. 


since the alternative form \ 


0 




! P 1 


!a 


X 




LJL J 


1 


N 





will not 



satisfy the equation AT p =CX c 

(12) The fragmentary Playfair square ^ 9 has been established, in one 
of its 25 possible cyclic permutations, as 



C A 
E 

HI0XJ1 

P 

I 



Scanning the list of plain-cipher equivalents given m subpsr d(7) m 
order to insert possible additional letters, none is iound But seeing 
that several high-frequency letters have already been inserted m tbc 
matrix, perhaps reference to the ciyptogram itself m connection with 
values derived from these inserted letters may yield further clues For 
example, the vowels A, E, I, and 0 are all m .position, as are the very 
frequent consonants N and T The following combinations may be siudied*" 



Mp-ex c 

EWp=6T c 

INp=0T c 

OIIpzXHc 



'ATp=GX c 

mm* 

ITp=CP c 

'OT p =XO c 



HAplXe c TApiXCe 

T?p=P? c 
TIo-?Cc 

; OprHXc TOp-OXc 

IT ^ — 



In actual practice, it je more usual to start witr a much laxgei 
diagram than a simple 5*5 squcure, as relationships develop, tne diagram 
is giadually condensed, until finally a 3 y 5 square ec_rges This pio- 
ccdurc is quite similar to tnat employed in the re cons ti uction diagrams 
for tv, o- square matrices 
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AT p ( =CX c ) , TAp( =XC c ) , ONp( =XH C ), TE p ( =PT C ) and £Tp( =TP^) have already 

been inserted in the text Of the others, only OX c (~TOp) occurs two 
times, and this value can be at once inserted in the text But can the 

equivalents of M, EN, or IN be found from frequency considerations? 

Take ENp, for example, it is represented by ©T c What combination of 6T 
is mosu likely to represent ENp among the following cahdidates 

KT C (4 times), by Rule I, NEp would = TKc (no occurrences) 

VT C (5 times), by Rule I, NEp would s TV C (2 times) 

ZT C (3 times), by Rule I, NEp would : TZ C (l time) 

VT g certainly looks good it begins the message, suggesting the word 
ENEMY, and the sequence PZTV C , in line H, would become the plaintext 
sequence LINE. Let this he assumed to he correct, and let the word 
ENEMY also be assumed to be correct Then EMpZQE c and the partial squaie 
then becomes as shown herewith* 



P 

I 

C A 
V M E Q 
;N H T 0 X 



Figure 68a 

(13) In line E is seen the following sequence 

Line E . VT RK MW CF ZU BH TV YA BG IP RZ KP CQ FN LV 

EN RI NE RS PT -E 

The plaintext sequence RI, .NERS. .PT. . . suggests PRISONERS CAPTURED, 
as follows 

MW CF ZU BH TV YA BG IP RZ KP 
P RI SO NE RS CA PT UR ED 

This gives the following new values* 0PpSCF c , SOp=BH c , CAp=BG c , URp=RZ c , 
and EDp=KP c The letters B and G can he placed in position in the 

partial square at once, since the positions of C and A are already known 
The insertion of the letter B immediately permits the placement of the 
letter S, from the equation SOp=BH<j Of the remaining equations only 
EDp=KP c can be used Since E and P are fixed and are in the same column, 
D and K must be in the same column, and moreover the K must be in the 
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same row as E There is only one possible position for K, viz , immedi- 
ately after Q This Automatically fixes the po ext ion of D The square 

is now as shown herewith. 



P D 
I 

G S C B A 
V M E Q K 
N H T 0 X 



Figure 68b 

(l4) A review of all equations, including the very first ones es- 
tablished, gives the following which may now be used HBpfFA CJ RSptYAg 
The first permits the immediate placement of F, the second, by elimi- 
nation of possible positions, permits the placement of both R and Y 
The sq j is now as shown herewith 



P F D 
Y I R 
G S C B A 
V M E Q K 
N H T 0 X 



Figure 68£ 

Once more a review is made of all remainin g un used equations LIp=PZ c 
now permits the placement of L and Z IRp=UZ c now permits the placement 

of U, which is confirmed by the equation TJRp=RZ c from the word CAPTURED. 
There is then only one cell vacant, and it must be occupied by the only 
letter left unplaced, viz , W Thus the whole square has been recon- 
structed, and the message can now be deciphered. 

L(W)P F D 
Z Y I U R 
G S C B A 
VMEQK 
H H T 0 X 



Figure 68d 

f Reconstruction of the square m Playfair ciphers is normally 
carried on concurrently with the synthesis of the plain text, once a few 
correct assumptions have been made. Now, having just leconstructed the 
square as shown in Fig 68d, the question to be answered is whether this 
square is identical with the original enciphering matux or whether it 
is a cyclic permutation of the original squaie (which my have contained, 
say, a tianspos it ion -mixed sequence) Even though the cryptogram in 
subpar 71 ® has been solved, this point is still of interest 

I 
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(l) Tho square that is derived may not necessarily be the original 
enciphering square, more than likely it will be one of the 2k possible 



cyclic permutations of the original square If the Playfair square con- 
sisted of a key ^-rd -mixed sequence, a permutation of the square will 
cause no difficulty in recovering the original matrix and hence the key 
word For example, if the square derived in some other instance is 
QTLKO then the square P Y R A M is easily recovered because of the 



X Z U V w 
AMPYB 
B C I D S 
II K E F G 



IDSBC 
E F G H K 
L N 0 Q T 
U V W X Z 



tell-tale letters UVtJXZ occurring in a row of the derivative square But 
when the Playfan square consists of a transposition-mixed sequence, then 
a different procedure must be adopted 



(2) As an example, let us take the transposition matrix 
5861 A 327 from which A F T D K is the original square 
PYRAMIDS WIHVM 

BCEFGHKL GUPBH 

I'D'ITUVWX ZREQS 

l L X Y C 0 



Using the 



methods illustrated in par. 51g, scanning successive rows of the square 
will disclose sequences of letters which could have appeared as columns 



in the transposition matrix 



For example, discovery of the columns I 

H 

V 



will afford rapid recovery of the key word. But if instead of the original 
square we had one of its permutations such as Q S Z R E, then treatment 

C 0 L X Y 
D K A F T 
V M W I H 
B N G U P 



of the "columns", e.g , F 


V 


0 


L 


T 


M 


L 


X 


V 


W 


X 


Y 



of the tentative transposition matiix 



(assuming that some or all of the letters V, U, X, Y, Z are in the last 
row of the transposition matrix) will be without significance, therefore 
the procedure above is inapplicable without a slight modification. 

(3) Since it will be noted that a permutation of the rows will not 
affect the procedure of keyword recovery, then we construct a 9x5 rec- 
tangle QSZREQSZF which contains the five Bquares which result 
COLXYCOLX 
DKAFTDKAF 
VMWIHVMWI 
BNGUPBUGU 



simply from successive permutations of the columns. A 5x5 cut-out square 
will be found convenient in testing each permutation in turn Affirma- 
tive results will be obtained when the correct permutation is reached. 



RESTRICTED 



213 




RESTRICTED 



REF ID:A56892 



which in this case is the third square in the rectangle , namely, 
ZREQS. After recovery of the key word from this permuted “square it 
L X Y C 0 
A F T D K 
WIHVM 
G U P B N 



is probable then that the original enciphering square must have been 

A F T D K. 

WIHVM 
GUPBN 
Z R E Q S 
L X Y C 0 

( 4 ) In the case of the square recovered in Fig 68d, it is found 
that, following the procedure outlined in subpars (l), (2) , and (3) 
above he key word is based on COMPANY, recoverable from the following 
diagram* 



2536147 

COMPANY 
B D E F G H I 
KLQRSTU 
V W X z 

The original square must have been this 



A 


G 


S 


C 


B 


K 


V 


M 


E 


Q 


X 


N 


H 


T 


0 


D 


L 


W 


P 


F 


R 


Z 


Y 


I 


U 



Figure 60 £ 

g Continued practice m the solution of Playfair ciphers will make 
the student quite expert in the matter and will enable him to solve 
shorter and shorter messages +0 Also, with practice it will become a 
matter of indifference to him as to whether the letters are inserted in 
the square with any sort of regulai lty, such as simple keyword -mixed 
order, transposition-mixed ordei, or in a purely random order 

h It may perhaps seem to the student that the foregoing steps are 
somewhat too artif icia] , a bit boo "cut and dried" in their accuracy to 
portray the process of analysis as it is applied in practice For 
example, the critical student may well object to some of the assumptions 
and the reasoning in subpar £(5) , above, m which the words THREE and 



,0 Tiie author once had a Student who "specialized" m Playfair ciphers 
and became so adept that he could solve messages containing as few as 
50-60 letters within 30 minutes 
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ONE (lot hypothetic) weic lejrctcd in favor of the words THIRD and SECOND 
(2nd hypothesis) This looted 3 ai gely upon the rejection of RD p end ERp 
as the equivalents of UZ C and ZU c , and life adoption of IRp and Rjp as 
the a equivalents Indeed if the student vil3 c> amine the finoD message 
vitli a critical eye, he will find that while the bit of leasoning m step 

(5) is perfectly logical, the assumption upon which it is based is in 

fact viong, for it happens that in this case ERp occurs only once and REp 
does not occur at all Consequently, a3 though most of the reasoning 
which 3ed to the 1 ejection of the first hypothesis and the adoption of 
the second vas logical, it was in fact based upon erroneous assumption 
In other words, despite the iact that the assumption was incorrect, a 
correct deduction was made The student should take note that in crypt - 
analysis situations oi this sort ate not at a31 unusual Indeed they are 
to he expected, and a few words of explanation at this point may be use- 
ful 

jL Ciyptanalysis is a science in whJch deduction, based upon ob- 
servational. data, plays a veiy large role But it is also true that m 
this science most of the deductions usually rest upon assumptions It 
is most often the case that the cryptanalyst is forced to make his as- 
sumptions based upon a quite limited amount of text. It cannot be ex- 
pected that assumptions based upon statistical generalizations will 
always hold true vhen applied to data comparatively very much smaller in 
quantity than the total data used to derive the generalized rules 
Consequently, as regards assumptions made in specific messages, most of 
the time they will be correct, but occasionally they will be incorrect l ' 1 
In cryptanalysis it is often found that among the correct deductions 
there will he cases m which subsequently discovered facts do not bear 
out the assumptions on which the deduction was bssed Indeed, it is 
sometimes true that if the facts had been known before the deduction was 
made, this knowledge would have prevented making the correct deduction 
For example, suppose the cryptanalyst had somehow or other divined that 
the message under consideration contained no RE, only one ER, one IR, and 
two RI's (as is actually the case) He would certainly not have been 
able to choose between the words THREE and 01® (1st hypothesis) as 
against THIRD and SECOND (2d hypothesis) JBut beca use he assumes that 
there should be more ERp's and REp's than IRp’s and Rip's in the message, 

he deduces that UZ C cannot be REp, rejects the first hypothesis and takes 
the second It later turns out, after the problem has been solved, that 
the deduction vas correct, although the assumption on which it was based 
(expectation of more frequent appearance of RE p and ERp) was, in fact, 
not true in this particular case The cryptanalyst can only hope that 
the number of times when his deductions are correct, even though based 
upon assumptions which later turn out to be erroneous, will abundantly 
exceed the number of times vhen his deductions are wrong, even though 
based upon assumptions which later prove to be correct If he is lucky, 



Ul 



See footnote 1 j on page 52 
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the making of an assumption which is really not true will make no dif- 
ference in the end and will not delay solution, hut if he is specially 
favored with luck, it may actually help him solve the message — as was the 
case m this particular example 

Another comment of a general nature may be made in connection 
with this specific example. The student may ask what would have been the 
procedure in this case if the message had not contained such a tell-tale 
repetition as the word BATTALION, which formed the point of departure 
for the solution, or, as it is often said, permitted an "entering wedge" 
to be driven into the message The answer to his query is that if the 
word BATTALION had not been repeated, there would probably have been some 
other repetition which would have permitted the same sort of attack ^ If 
the student is looking for cut and dried, straightforward, unvarying 
methods of attack, he should remember that crypt analys is, while con- 
sidered a branch of mathematics by some, is not a science which has many 
"general solutions" such as aie found and expected in mathematics proper 
It is inherent in the very nature of cryptanalyfcics that, as a rule, 
only general principles can be established, their practical appli- 
cation must take advantage of peculiarities and particular situations 
which are noted in specific messages This is especially true in a text 
on the subject The illustration of a general principle requires a 
specific example, and the latter must of necessity manifest character- 
istics which make it difierent from any other example The word BAT- 
TALION was not purposely repeated in this example in order to make the 
demonstration of solution easy, "it just happened that way" In another 
example, some other entering wedge would have been found The student 
can he expected to learn only the general principles which will enable 
him to take advantage of the specific characteristics manifested in 
specific cases Here it is desared to illustrate the general principles 
of solving Playfair ciphers and to point out the fact that entering 
wedges must and can be found. The specific nature of the entering wedge 
varies with specific examples 

72 Analysis of polygraphic systems involving large tables — a The 
analysis of systems incorporating large digraphic tables is accomplished 
by entering, within the appropriate cells of a 26 x 26 chart, data corres- 
ponding to the plain-cipher lelationships of assumed cribs on 26 x 26 
chaits, and examining the charts for evidences of symmetry or systematic 
construction in their compilation The initial plaintext entries may, 
in the absence of cribs, he made on the basis of digraphic frequency 
considerations, aided by ldiomorphisms and repetitions 

b. In pseudo-digraphic systems, such as those incorporating tables 
similar to Figs 47a and b, and 48, the identification of the monoalpha- 
betically-enciphered component of cipher digraphs will greatly accelerate 
plaintext entries, »ince advantage may be taken of this monoalphabet 1 - 
city. Tables with a feature of reciprocity, such as the example m 
Fig 50 , may be exploited on the basis of this veakness, even if the re- 
ciprocal pairs are assigned at random Tables such as that in Fig 49 
and the one for trinome digraph} c encipherment shown m Fig 51 may also 
he exploited with facility, once enough plain text has been correctly 
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assumed and inserted to disclose their systematic construction A word 
of warning is inserted here against making incautious assumptions con- 
cerning the exact internal composition of tables suen as that in Fig 
since their unusual construction could easily mislead the analyst who 
Jumps to premature conclusions. In the case of a tabie such as Fig 53 
wherein the trinomes have been inscribed in straight horizontals (or for 
that matter, any other known inscription), if the dimensions of the table 
have been correctly assumed the simplest solution involves a reduction to 
two alphabets, reflecting the sequences of letters for the side and top 
of the matrix, this solution closely parallels that of the numerical 
four-square system described in subpar 6 $e 

c Because the foregoing principles are rather straightforward, it 
is not considered necessary to illustrate their application with examples 
Of course, when d] graphic tables of random construction have been used, 
no refinements in solution are possible However, the recording of as 
few as 225 different plaintext digraphs and their ciphertext equivalents 
will theoretically enable the automatic decryption of approximately 92$ 
of the cipher digraphs of messages^ and the recording of 335 plaintext- 
ciphertext va 3 ucs will enable the automatic decryption of 98$ of the 
cipher digraphs, thus almost every message may be read m its entirety 
without recourse to further assumptions Actually, it should be pointed 
out that having only 122 matched plaintext -ciphertext equivalencies will 
theoi etically enable the decryption of 75$ of the cipher digraphs, and 
enough skeletons of plain text may then be manifest to permit the decrypt- 
ion of the complete message texts 

d It might be well to point out in connection with large di- 
graphic tables that there exist literal types which give rise to mono- 
alphabetic distributions for both the initial letters and final letters 
of pairs Such a table is illustrated in Fig 69 below 
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A 


B 


C 


D 


E 


F 


G 


H 


I 


J 


K 


L 


H 


N 


0 


P 


Q 


R 


s 


T 


U 


V 


W 


X 


Y 


Z 


A 


HQ 


YQ 


DQ 


RQ 


AQ 


UQ 


LQ 


IQ 


CQ 


BQ 


EQ 


rQ 


OQ 


JQ 


KQ 


MQ 


HQ 


OQ 


PQ 


QQ 


SQ 


TQ 


VQ 


WQ 


XQ 


ZQ 


B 


HU 


TO 


DU 


HU 


AD 


UU 


LU 


IU 


CD 


BU 


EU 


FU 


GU 


JD 


HU 


HU 


HU 


OU 


PU 


QU 


SU 


TU 


VU 


WU 


XU 


23J 


C 


HE 


YE 


DE 


RE 


AE 


UE 


LE 


IE 


CE 


BE 


EE 


FE 


GE 


JE 


KE 


HE 


HE 


OE 


PE 


QE 


SE 


TE 


VE 


WE 


XE 


2E 


D 


HS 


TO 


DS 


RS 


AS 


US 


IS 


IS 


CS 


BS 


ES 


FS 


GS 


JS 


KS 


MS 


IB 


OS 


PS 


OS 


ss 


TS 


VS 


ws 


XS 


zs 


E 


HT 


YT 


DT 


HT 


AT 


UT 


IT 


IT 


CT 


BT 


ST 


FT 


GT 


JT 


KT 


ME 


HT 


or 


PT 


or 


ST 


TT 


VT 


WT 


XT 


ZT 


F 


HI 


YI 


DI 


RI 


AI 


UI 


LI 


II 


Cl 


BI 


El 


FI 


GI 


JI 


KI 


MI 


HI 


01 


PI 


QI 


SI 


TI 


VI 


WI 


XI 


ZI 


G 


HD 


TO 


DO 


RO 


AO 


DO 


ID 


10 


CO 


BO 


EO 


PO 


GO 


JD 


XD 


MO 


no 


00 


PO 


QQ 


SO 


TO 


VO 


WO 


XO 


ZO 


H 


HR 


YN 


DN 


RN 


AH 


UN 


IN 


IN 


CH 


BN 


EN 


FN 


GN 


JN 


KN 


MN 


HH 
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Figure 69. 



In effect, encipherment by means of such a system yields the equivalent 
of a two-alphabet cipher, with a transposition within each of the pairB 
of letters. The cipher text produced by such a system may be character- 
ized by a large number of repetitions which begin with the initial letter 
of digraphs and end on the final letter of digraphs and which are pre- 
ceded by digraphs having repeated initial letters or which are followed 
by digraphs having repeated f anal letters; for example, ciphertext 
passages of the following type might often arise: SF BD GB HX and 

SQ BD GB WK (wherein the repeated plain text is actually represented by 
SDHBGK, affected by the transposition) . This system is included here as 
being illustrative of many simple systems which are capable of leading 
the student very much astray in this Instance, if one were unaware of 
the transposition feature involved and were to attempt what appears to 
be the single task of fitting plain text into the two monoalphabetic 
portions on the basis of single-letter frequency cons iderat 30 ns, he could 
spend a great deal of tune without success— probably without any idea of 
what was causing his difficulties. 

e. A. pseudo-trigraphic cipher involving a table such as that in 
Fig. 52 may be readLly recognized as such, since two letters of each tri- 
graph enciphered by means of such a table are treated monoalphabet ically. 
If three separate uniliterai frequency distributions are made — one for 
each of the three letters of the cipher trigraphs — two of the distri- 
butions should be monoalphabetic. Then, exploiting the monoalphabet icity 
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(i.e. , the positional monoalphabeticity) thus disclosed in the cipher 
text, plain text can be fitted to the cipher on the basis of s ingle - 
letter frequency considerations; in addition, advantage may be taken of 
partial idiomorphisms , if these ldiomorphisms involve the particular 
positions of the tr] graphs which have been treated monoalphabet ically. 

f . Fortunately, it is unlikely that trigraphic systems other than 
the foiegoing pseudo -t r lgr aph i c type will be encountered, because they 
are difficult to manipulate without extensive tables or complicated rules 
for encryption.^ 1 The subject can be passed over with the simple state- 
ment that their analysis requires much text to permit of solution by the 
frequency method, — and blood, sweat, and tears 

73 « Further remarks on polygraphic substitution systems . — a. In the 
treatment of the cryptography of the various digraphic systems in this 
Section, the rules for encryption and decryption which have been illus- 
trated are the "standard" nii 0 s (i.e., the rules extant in cryptologic 
literature, or the rules iropI coranonly encountered m operations! prac- 
tice) . Needless to say, however, there Is no cryptologic counterpart of 
the Geneva Convention making the Be rules sacrosanct, nor forbidding the 
use of other rules for enciphering and deciphering. 

b. In two-square systems and Playfair systems there are possible 
(and, jn fact, there have been encountered in operational practice) modi- 
fications of the usual enciphering and deciphering rules which, if not 
suspected, may pose difficulties jn the identification of such systems 
and in their cryptanalysis. For example, in a vertical two-square system, 
when two plaintext letters fall in the same column, their cipher equiva- 
lents might be taken as the letters immediately to the right of or im- 
mediately below these plaintext letters. Similarly, in a horizontal two- 
square system, if two plaintext letters are in the same row, their cipher 
equivalents might be taken as those immediately below, or to the right of 
these letters. In Playfair cipher systems, two plaintext letters in the 
same row might be represented by the letters immediately below, two 
plaintext letters in the same column might be represented by the letters 
immediately to the right; a plaintext doublet might be represented by a 
ciphertext doublet fozmed by doubling the letter immediately to the 
right, or below, or diagonally to the right and below, thus removing one 
of the identifying ciphertext characteristics of the normal Playfair 
system . In one case encountered, Instead of the normal Playfair linear 
relationship AB p iBC c , the rule was changed to ABp=CB c (thus allowing a 



m 

However, see in this connection Appendix 8, "Lester S. Hill alge- 
braic encipherment", which gives a mathematical treatment of true poly- 
graphic encipherment for polygraphs of any size. (See also subpor. 73h). 
1(2 “ ' ' 

If a trigraphic system is encountered in operational cryptanalysis, 

special solutions would be made possible by the application of cribs, the 
aid furnished by isologs (not only in the same system, but also between 
systems), etc. 
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letter to "represent itself" — an "impossibility" in Playfair encipher- 
ment), even this simple modification caused diff iculties in cryptanalysis 
because variant rules for encryption nad not been considered 



c The placing of cribs in small -matrix digraphic systems may be 
guided by the cryptographic peculiarities of these systems, when the 
general system is known to, or suspected by the cryptanalyst, conversely, 
the placing of a known crib may assist in the determination of the type 
of cryptosystem, or in the rejection of other types of systems For 
example, cribs may be placed in Playfair ciphers on the basis of the 
"non-crashing" feature of the normal Playfair, that is, on the basis 
that in the equation 1 2=3 ^ neither 1 and 3 nor 2 and k can be identical 
In horizontal two-square systems, if = j*p, then «i(3 c must equal p«tp, 

and if «*P C = (*__ p, then *».(3 C must equal pet p If, by placing a known crib 

in a cryptogram, evidence of non-rcciprocity is disclosed (e g , if 
ABp=CD c , but CDp=XYc) , the ciyptogram may he assumed to be other than a 
vertical two-square cipher, since vertical two-square encipherment yields 
complete leciprocity In eithei type of two-square system, if one of the 
two squares is known (for example, a vertical two-square might be employed 
in which the upper square is always a normal alphabet), the placement of 
cribs is materially facilitated 

d The $ test performed separately on the initial letter and final 
letter of ciphertext pairs from cryptograms produced by small -matrix di- 
graphic systems will give results neither close to that expected for 
plain text, nor close to that for landom text The reason for the com- 
parative "roughness" or pronounced differences among the relative fre- 
quencies in these distributions, as contrasted with the "smoothness" 
expected of random, is that small -matrix digraphic systems are only 
partially digraphic in natuie and that the encryption Involves charactei- 
istics similar to those of monoalphabet ic substitution with variants 
Tills roughness of the uniliteral frequency distributions for the prefixes 
and suffixes, and, for that matter, for the over-all cipher text, re- 
flects the partially digraphic nature of the encipherment 



£ If the cipher letters V, W, X, Y, and Z are of very low fre- 
quency in the over-all uniliteial frequency distribution of a digraphic 
ciyptogram or set of cryptograms, this may he taken as evidence that the 
cryptosystem Is a small -matrix digraphic system employing keyword-mixed 
sequences in the matrix or matrices Furthermore , in small-matnx 
systems involving keyword-mixed squares, if 0^ of 08 c is one of the 

letters lA/XYZ, the 6^ of the corresponding 00^ is likely to be one of 
these same letteis 



run liar ly, if ©| 



is one of the letters VWXYZ, then 



8p of the corresponding 00p is likely to t be one of these letters 



f In tnnome -digraphic systems employing large tables, the tri- 
nomes may run from 001 to 676 , as m Fig 51, or any consecutive set of 
676 tunones in the scale of 1000 possible tnnomes may be used For 
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that mailer, the enlire spon of trinomes 000-999 might he mod an such a 
table, with occaoioual gnpo, to hide the limitation:? of this system As 
another means of disguising tie limitation of 676 tr monies in such a 
system, three of the initial djgits of the tr monies might have one vari- 
ant each — thus no limitation would exist in the first po r ltion of tn- 
nomes The 001, or other starting point m the cyclic scale, need not be 
at the upper left-hand corner of the table The Cj6 ti monies m ruch 
-fables may be inscribed in straight horizontal s (i e-, m the normal 
manner of wilting) as in Fig 51, or they might he inscribed according 
to some other route, they probably would not be inscribed in a rardom 
manner because clumsy "deciphering tables'' would then be necessary It 
is also possible that the trinomes in 0 trmone-digraphic system might be 
converted into tetranomes by the addition of o sum-check (to assist m 
ei ror -correct 3 on) 

£ The cryptanalysis of tetranome-tngraphic systems with matrices 
similar to that illustrated in Fig 59 involves a modification of the 
technique used in solving inverse four-square systems If the plain- 
component and ciplvu -component sections of the large square have been 
inscribed according to the normal manner of writing (or any other manner, 
if known ) , the first two elements of tne trigraphs may bo reduced to a 
pair of cipher alphabets, and these two monoolphabetic substitutions may 
be solved as indicated in subpar 69e The applicability of inverse 
four-square solution principles to this tetranomo-tri graphic system of 
course rests on the fact that the ciphertext sections are known or as- 
sumed to contain the dinoncs 00-99 in numerical o’-dcr, inscribed m the 
normal manner of writing , the convers ion of the f irst two elements of the 
trigraphs depends upon the knowledge of the manner of inscription of the 
letters of the plain component sections, m order that me four occur - 
rences of the initial letteis and the four occurrences of the final 
letters may be conectly combined into two nonoalphabet ic distributions 
Of course, if the composition of the smal3 square (for the thirf 3 element 
of tngrsphs) is known, the third letter of tngrapns may be automa- 
tically deciphered If the composition of the small square is not l rin om, 
a consideration of the frequencies of the converted dinomes for the noil 
square (1 e , the coordinates of the square to indicate the third m nor 
of trigraphs) may be used to obtain on entering wedge into this thud 
monoalphabet ic substitution 

h There are but a very limited nurnoer of known cipher me cl an isms 
which employ the polygraph.: c cncipnenrent principle m any form U.S 
Patent Wo 1515680 i.sstced to A Henkels in 1924 and U S Potent Wo 
184591)7 issued to Weisner and Hill m 1932 describe two such mechanisms 
which produce poD ygraphic sdbstitutio The latter, tnal of Veisner and 
Hill, is of porticular mteiest because it is based on a rather simple 
mathematical process which can yield true polygraphic encipnerwent for 
polygraphs of any size The underlying mathematical process, ±r /ented 
oy Pi of Lestei S Hill of Hunter College ard described in the 'Wcrican 
Mathematical Monthly" m 1929 (Vol XXXVI, p 306) and 1931 (Vol 
XXXVIII, p 135), is treated briefly, oelow, a more detailed treatment 
is contained m Appendix 8, "Lester S Hill algebraic encipherment", 
which also includes remarks on the cryptanalysis of tms method of cn- 
cipheimcnt 
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(1) Snnce Professor Hill's system is mathematical m nature, the 
first step in its use involves the conversion of the plaintext letters 
into numbers "by means of a conversion alphabet which shows a corres- 
pondence "between the 26 letters of the alphabet and the 26 numbers from 
0 to 25 , such as the following* 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 
0 9 3 5 24 6 18 8 11 1 21 14 15 12 4 10 25 17 7 19 20 2 22 16 23 13 

(2) The numbers obtained through the conversion of the plaintext 
letters are next treated arithmetically through the application of alge- 
braic linear functions, this treatment being performed by means of mod 26 
arithmetic ^3 The numerical results yielded by the algebraic treatment 
are then converted back into letters by means of the conversion alphabet-, 
to yield the cipher equivalent of the original plain text 

(3) For example, suppose that the message "NOTHING TO REPORT" is to 
be enciphered by trigraphs, and that, for this purpose, the enciphering 
keys 1 * *" are 1, 2, 1, 5, H, 3, 2, 4, 13 The message would be divided 
into trigraphs NOT-HIN-GTO-REP-ORT and the letters which result from the 
following operation would be taken as the cipher equivalent of the first 
trigraph. 

Using the conversion alphabet in (l), above, (N 0 T) is converted 
into (12 4 19), then the foregoing keys are applied — 

lx 12+ 2x4 + 1x19 = 12+8 + 19 s 13+1(26) = Z 

5X12 + 11X4 + 3X19 = 8 + 18+5 « 5 + 1(26) = D 

2X12 + 4x4 + 13X19 = 24 + 16 + 13 = 1+2(26) = J 
Thus, NOTp is enciphered as ZDJ C 

(4) A large number of sets of enciphering and deciphering keys can 
be constructed It is even possible to construct keys which yield re- 
ciprocal encipherment, and it is this possibility which makes practicable 
the construction of a machine or device to accomplish the enciphering and 
deciphering 



Tl Vo 1 1 n n 1 

J Using "mod 26 arithmetic", one considers as the sum or product of 
two numbeis, the number from 0-25 which is obtained by subtracting 26 
(or a multiple of 26 ) from the ordinary arithmetical sum or product of 
the numbers 
44 

Encipherment of polygraphs containing n letters requnes the use of 
n 2 keys Thus, 9 keys are necessaiy ior trigraphic encipherment, di- 
giaphic encipheimerrc requires only 4 keys, whereas tetragraphic and 
pentagraphic encipherment necessitate the use of 16 and 25 keys, respec- 
tively The numbers selected for use as keys mu%t be chosen according 
tp rathei definite lules based on tne "theory of determinants", other- 
wise, cryptographic smbiguity may result when decipherment is attempted 
Appendix 8 contains more on this matter 
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i. Attention is called here to the applications of Table 13 ("Four- 
square individual frequencies") of Appendix 2j this table has been re- 
produced here for convenience. If the cryptanalyst has at hand a fairly 

(Table 13 , Appendix 2) 

(Tlnm il on a count of 6,000 dlgrnplin] 
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large volume of cipher digraphs produced by encipherment with a normal 
four-square, he may use Table 13 as on aid in placing the initial letters 
and final letters of the cipher digraphs Into the appropriate cell? of the 
cipher component sections on the basis of their uniliteral frequencies . 

Thus, if a distribution made of the initial letters of cipher pair , m a 
particular example shows Qq, I c , and C c to be the letters of predominantly 
high frequency (listed in descending order of frequency), and if the 
distribution of the final letters shows F c , Qc, and P c as the letters of pre - 
dominantly high frequency (in descending order of fr.quency), these 
letters may be tentatively placed into a skeleton four-square matrix as 
follows (Fig 70 ) t based on the locations of the highest frequencies as 
given in Table 13 
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Figure 70 



In attempting to diagnose the underlying cryptosystem in any 
particular polygraphic cipher, the student may gain some assistance from 
the following recapitulation 

(1) In digraphic ciphers the majority of repetitions will he an 
even number of letters apart and these repetitions should for the most 
part begin on the first letters of pairs and end on the last letters of 
pairs The majority of repetitions in trigraphic ciphers will he some 
multiple of three letters apart and these repetitions should for the most 
part begin on the first letters of fcrigraphs and end on the last letters 
of trigraphs, 

( 2 ) Digraphic cipheis may he revealed as such by the digraphlc phi 
test, with additional support being given by the digraphic blanlc- 
expectation test, the presence of a null letter at the beginning of the 
cipher text might he disclosed by applying the two foregoing tests to a 
distribution of the digraphs which are formed when the first letter of 
the texb is omitted 

( 3 ) If either the uniliteral frequency distribution for the initial 
letters or for the final letteis of the digraphs in a cryptogram exhibits 
monoalphabeticity, the cryptogram is probably a pseudo -digraphic cipher 
involving a large table of the type in Fig 47 or 48 If both of the 
foregoing uniliteral frequency distributions reflect monoalphabeticity, 
the cryptogram may involve the use of a table of the type in Fig 69 

(4) If the "decipherment" of a cryptogram by means of a four-square 
matrix containing four normal alphabets yields two' monoalphabetic substi- 
tutions — one for the initial letters and one for the final letters of the 
pseudo-decipherment--the cryptogram may be assumed to be an inverse four- 
square cipher. 

(5) If sin ocular inspection or statistical evaluation of the cipher 
text of a cryptogram reveals a large number of "transparencies", the 
cryptogram probably involves a two-square system 
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(6) If a cryptogram contains several cipher doublets, all of which 
are bioken up when the cipher text is divided into digraphs, the crypto- 
gram may well involve normal Playfajr encipherment 

(7) If the cipher text of a cryptogram exhibits any invariable af- 
finity of one of the letters J, K, Q, X, or Z for vowels (or, for that 
matter, another cluster of 5 or 6 letters), the cryptogram probably is 
in a small -matrix system employing sections consisting of moie than 25 
letters 

k If a particular four-square cryptogram involves the use of a 
matrix in which either the plain component sections or the cipher com- 
ponent sections are normal alphabets, the matrix will be recovered 
through cryptanalysis in its original form, even when the components 
which aie maxed have been derived by a transposition method or by no 
method at all In Playfair cipher solution, the matrix can be recovered 
in its original form as long as the original matrix has been mixed m 
some systematic manner However, m the case of two-square solution, 
there is no guarantee that the matrix can be recovered in its original 
form unless the original matrix has been keyword -mixed, if the original 
has been transposition-mixed, for example, the matrix which has been 
recovered through cryptanalysis — while being cryptographically equivalent 
to the original — will undoubtedly involve a permutation of the rows and 
columns of the original 

1 When four-square systems are encountered m which the matrix 
consists of four differently-mixed sections, reconstruction of the matrix 
is accomplished in a manner similar to that used in the analysis of two- 
square cipheis If the sections are composed of keyword-mixed sequences, 
the original mati ix may be recovered Otherwise, the reconstructed 
matrix will m all probability be a permutation of both Urn rows and the 
columns of the original matrix, and there may be no way of lecovenng or 
or proving the original matrix 

m In passing, it might be well to mention that an/ two-square 
system can be solved as a four-square system in which tli** matrix 13 
composed of four mixed sections, upon the realization, figure phenomena fin 
the matrix reconstruction, that a two-square matrix is involved, the 
proper conversion ca r > then easily be made 
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SECTION X 

CRYPTOSYSTEMS EMPLOYING IRREGULAR -LENGTH CIPHERTEXT UNITS 
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74. Preliminary observations . --a. The cipher alphabets of nearly all 
of the various cryptosystems treated~thus far in this text have involved 
cipher units of a constant length. 1 That is, the ciphertext units have 
been (prior to regrouping into fives for transmission) either single charac- 
ters, or pairs of characters, or three -character groupings, or, in the case 
of the Baconian and Baudot alphabets, 5 -element ciphertext units; however, 
within a given cryptosystem the lengths of the ciphertext units have been 
consistent , and It is this consistency that has been of most importance to 
the cryptanalyst. 

b. There is no reason why a cryptographer could not vary the size of 
the cipher units in a particular cryptosystem, as long as no cryptographic 
ambiguity in deciphering would resu3 fc thereby. Furthermore, if the 
size of the cryptographic units is varied within a particular cryptosys- 
tem, obstacles are put in the way of cryptanalytic attack on the system — 
varying the length of the ciphertext groupings complicates the crypta- 
nalyst’s preliminary task of dividing the cipher text into the proper 
units for study. In this connection, the student should refer back to 
par. 63 and read again the remarks on the use of nulls which differ in 
size from the real cryptographic units . The example contained therein 
makes it clear that, until such nulls are identified and isolated by the 
cryptanalyst, he Is unable to divide the cipher text properly and make 
appropriate frequency distributions . However, nulls may sometimes be rec- 
ognized as such because they do not behave like units which, represent 
actual plaintext elements . For example, in the three almost -identical 
ciphertext passages below, 

(a) . . .181^5 11343 71129 3219# 23231 52937... 

(b) ...18151 01343 71129 32192 32^31 52937... 

(c) ...18151 13437 1^129 32192 30231 52937... 



The only exceptions have been in the digraphic systems using the 
matrices illustrated in Figs . 57a and 57b , in which a plaintext digraph 
may be represented by a ciphertext digraph, trigraph, or tetragraph, 
depending upon the identity of the plaintext digraph. 
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the "behavior of the digit 0 is characteristic of a null, and when this is 
recognized and eliminated, the remaining cryptographic text may he broken 
up into its real units and solved quite readily. 

c. Since it has been indicated above that there are weaknesses in 
a scheme in which all cipher elements do not behave like equivalents for 
plaintext elements, it would be logical then to devise a system in which 
different -sized ciphertext units all represent actual plaintext elements 
and thus do behave more or less alike . It is easy to draw up cipher alph- 
abets in which, for example, some of the letters are represented by single 
digits , others by pairs of digits . Such a system, called a monome-dinome 
system?, would produce cipher text which is an irregular intermixture of 
uniliteral and multilitcral equivalents. From the cryptanalytic standpoint, 
the decomposition of such cipher text could he very difficult for the an- 
alyst who does not knot? which digits to treat separately, which in pairs . 
Such systems, and similar variations, are given detailed treatment in the 
following paragraph. 

75. Types of alphabets with irregular -length ciphertext units .— a. One 
simple scheme for yielding single -digit equivalents for some letters and 
two-digit equivalents for others makes use of a rectangular matrix which 
Is similar to some of the biliteral matrices of Sections VII and VTII, 
but which differs in that the top row of the matrix has no indicator (or 
coordinate). For examples, see Figs* 71-7^, "below. Each plaintext char- 



0183^52976 0183^52976 

m , r . r — r -. - ■ ■■. ■ ■ . ■ 
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Figure 71. 



Figure 72. 
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Figure 73. Figure ’jk . 



acter appearing in the top row in the matrix has as its cipher equivalent 
merely the monome which appears above it, among the coIuji coordinates ; 
thus. In Fig. 71, Ep = 0c, Tp » lc, Np = 8 C , etc. Each plaintext character 



2 See in this connection Foote, Alexander, Handbook for Spies , Hew York, 
19^9 » pp. 250-256, wherein is described such a cryptosystem reputedly 
typical of those used by secret agents in World War II. 
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appearing In one of the remaining rows has as its cipher equivalent the 
dinome formed by its row coordinate and column coordinate, respectively; 
thus in Fig, 71 , Gp = jh Q , Qp * 6 l c , etc. 



b. It should be noted that the external construction of all of the 
foregoing matrices is such that any digit which appears as a row coordi- 
nate does not occur as the monome equivalent for any letter; this limita- 
tion, accomplished by blanking out appropriate cells in the top row, is 
necessary in all monome-dinome systems in order that cryptographic ambi- 
guity will not arise. In Fig. 71, the internal composition is such that 
the plaintext letters which are most frequent In English are the ones 
which are provJ ded with monome equivalents . This type of arrangement 
theoretically provides the most economical encryption for any given mes- 
sage — that is, theoretically yields the shortest possible cipher text for a 
given plain text— but, of course, greatly limits the number of internal 
arrangements which may be used. Fig. 75, below, which is split into two 
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6 



5 2 9 7 6 
ABCDF 
G H I J K 
L II P Q S 
U V W X Y 
Z . ( ) » 



0 1 8 3 4 

ETHHO 



Figure 75 a. Figure 75b. 

separate parts — one providing the monome equivalents and the other provi- 
ding the dinome equivalents — illustrates another scheme for drawing up a 
monome-dinome cipher alphabet. In this alphabet, the digits which are 
used for the Initial and final elements of dinomes are completely distinct 
from the digits used as monome s . 



c. Most of the foregoing matrices contain a period for punctuation, 
and the matrix in Fig. 72, containing the single digits 0-9, provides a 
means for encrypting numbers without first spelling them out. The matri- 
ces in Figs. 71, 73, and 75 contain another character, symbolized by an 
asterisk, which may be used for punctuation or as a special inaicator3. 
The matrix in Fig. 7^ uses only nine of the single digits as coordinates, 
the digit 6 being omitted; this single digit might be employed as a word 
separator, a stop, or a null. The matrix in Fig. 76 , below, illustrates 



9 

For example, this special character may be put to use as an indica- 
tor to show that plaintext numbers begin or end, thus obviating the 
necessity of Including digits within the cipher matrix. In this usage 
digits in the plain text might be tripled and inserted in the cipher text 
with the appropriate indicator before and after the plaintext digits. 
Thus, using the matrix in Fig. 71, the plaintext fragment” . .HILL 865 .#” 
would be encrypted as the cipher sequence 75 2 77 77 66 888 666 555 66 
(prior to regrouping into five -character groups) . 
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may be represented in the matrix, as well as the single letters and 
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digits. The symbol # in this latter matrix could be used as aVrepeti- 
tion indicator" for checking numbers, as in the ciphertext passage 
69 65 68 76 69 65 68 , meaning the number 752 , the symbol * might be used 
as an indicator meaning "the immediately-preceding plaintext letter is 
repeated" (thus AA patterns would be suppressed in the cipher text) * In 
all of the foregoing matrices the order of inscription of the letters 
within the matrix, and the particular arrangement of the row- and column- 
coordinates are both subject to variation. 

d. By piearranged convention it is possible to employ ordinary commu- 
tative bipartite matrices (such as those already described in Sections VII 
and VIII ) In a manner which yields monome-dinome encipherment. For example, 
using the matrix illustrated in Fig. 77, the plaintext word EIGHT could be 



1 
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4 

5 




Figure 77. 



encrypted as 10 29 7 8 49. That is, the normal bipartite enciphering con- 
ventions would be used, with the exception that the row indicator in the 
cipher equivalent for a particular plaintext letter would not be employed 
when this row Indicator is the same as that for the immediately preceding 
letter of the plain text .4 As may be noted, no cryptographic ambiguity in 
decipherment may arise. 



Of course, as an extension of the foregoing Ideas, there could 
also be monome -dinoxce -tri nome systems, incorporating matrices of the types 
illustrated in Figs'." 78 - 82 , below. In Fig. 83 there is a matrix which may 



A variation of this method could make use of a convention by which 
the column indicator is dropped if It is the same as that for the piecedlng 
plaintext letter. 



?30 




REF ID : A56892 



CONFIDENTIAL 



a l o 3 4 0J_ 5 3 4 0183^5 





STOKE 


0 


a bT5T 


- 


BRANCH 


5 


ABCDF 


1 


a H I K L 


2 


DEFGIJ 


P 


6 BIKL 


8 


MPQSO 


52976 9 


KLMOPQ 


<? 


M F 0 R U 


3 


V W X Y Z 


TENOR 7 


S T U*V W X 


7 


i/wxyz 


45 


0 12 3 4 


29 


Y Z 0 1 2 3 


60 


0 12 3’+ 


42 


5 6 7 8 9 


27 


456789 


61 


5 6 7 8 9 











Figure 78. 



Figure 79a. Figure 79b. 



Figure 80. 
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Figure 8l. Figure 82. Figure 83. 



■be used for dinome -trinoroe enc iphermerrfc . Encipherment vith this latter 
matrix is commutative; for example , Ep ■ 24 or 42, and Tp ■ 621 or 162 . 



f . Literal versions of the preceding types of alphabets vith ir- 
regular-length cipher units are also possible. Several types are illus- 
trated in Figs , 84-88, including among them matrices permitting the use 
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Figure 87 a. 



Figure 87 b. 



of variants in encryption. Furthermore, any of the commutative variant 
matrices treated in par. 58 a (l.e., Figs. 27, 28, and 31) maybe used in 
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connection with the convention described in subpar. &, above, to provide 
cipher alphabets with irregular -length ciphertext units. 

76. General remarks on analysis . — a. The first step in the analysis 
of any cryptogram encrypted in a system with in egular-length cipher 
groupings involves dividing the cryptogram into the proper, vari-sized 
cipher units — that is, reducing the cryptogram to monoalphabetic terms. 

After this has been done, solution proceeds along the straightforward 
lines which have been described in earlier sections of the text. Thus, 
in this section, attention will be focused on this first step of dividing 
the text into its pioper monoalphabetic units. In order to simplify some- 
what the general treatment contained in this paragraph, all remarks will 
he directed at monome-dinome systems, most of the principles and methods 
outlined herein are general enough that they may he modified end applied 
in the solution of other types of systems with irregular-length ciphertext 
units . 

b. A cryptographer, in his process of deciphering a particular 
monome-dinome cryptogram, would begin by considering whether or not the 
first digit of the cipher text were among those digits which can start 
a dinome — that is, whether it were a row coordinate or not. If it were, 
he would treat it along with the next digit of the text as a diDome, 
and then proceed to consider whether or not the following digit were a 
row coordinate, etc. If the first digit of the message were not a row 
coordinate, he would treat it as a monome, and then proceed to consider 
whether or not the next digit were a row coordinate, etc. One may now 
see that the cryptographic process of dividing the cipher text into its 
propel units in based solely on a knowledge of the digits which are the 
row coordinates of the pertinent matrix. Thus, it may further be seen 
that the cryptanalytie attack on a monome -dinorce cipher would first 
involve an attempt to determine the identity of the row coordinates . 

£. If a given cryptogram involves a matrix in which the high- 
frequency plaintext elements are evenly distributed throughout the various 
rows, it may be expected that the particular digits occurring with the 
greatest frequency in a unlliteral frequency distribution made on the cipher 
text are those which are row coordinates of the pertinent matrix. This 
may be explained by the fact that the digits used as row coordinates occur 
in the cipher equivalents for more plaintext letters than do those digits 
which are used as monomes . However, one must remember that a monome- 
dinome matrix may involve two, three, four, or more row coordinates and, 
although in a particular instance it may be that the most frequent cipher 
digits are those digits which have been used as row coordinates, a study 
of the uniliteral frequency distribution may not make io cbvious as to 
just how many coordinates are involved, it may be necessary to make 
several trials, one considering only the two most frequent cipher digits 
as row coordinates, one considering the three most frequent, etc. 

d. If trials of the type just mentioned do not yield reduced, mono- 
alphabetic text which will succumb to the principles of plaintext recovery 
treated in the earlier sections of this text, it ray then be assumed that 
the cryptogram involves a matrix in which several of the high-frequency 
letters are arranged together in the top row or in which one or more columns 
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are composed »o3ely of high-frequency letters. Such matrices are likely 
to produce cipher text in which some of the digits which have "been used 
as monomes occur more frequently than some of those used as row coordi- 
nates, Thus, the easy mode of entry via the uniliteral frequency dis- 
tribution may not he used, and other approaches of a less clear-cut 
nature must he taken. 

e. In an attempt to identify at least one or two probable coordi- 
nates7 the analyst should carefully scrutinize the cryptogram itself in 
order to find passages exhibiting bipartite characteristics , such as ap- 
pear in the sequence 80438 I 87 IU, wherein ’the" digits' 8 and 4 "act" like 
digits which have been used as row coordinates’, being spaced off at in- 
tervals of two. A slightly more objective approach involves first making 
a biliteral? distribution of the cipher text, and then considering as a 
probable row coordinate the Initial digit of the particular dinome which 
the distribution shows to be the most frequent. Of course, this ap- 
proach is mosb likely to be valid when the particular dinome occurs with 
a much greater frequency than the remaining dlnomes . While still on the 
subject of distributions, it is pointed out that the previously-mentioned 
"bipartite characteristics" manifested in a cryptogram might be disclosed 
by making a bl literal distribution of alternate digits of the cipher 
text®, that is, in the sequence 123456" one would consider the dlnomes 13, 
24, 35, 46. In such a distribution, one may expect that the most frequent 
dinomes will be those comprising two digits which were both row coordinates 
of the pertinent enciphering matrix. 

f . If the cipher texb of a given monome -dinome cryptogram begins 
with a doubled digit, this digit is most probably one of the row coordi- 
nates of the pertinent matrix; otherwise, the doublet would have to be 
considered as comprising two monomes and the first word of the underlying 
plain text wouJd have to begin with a doublet (a very rare contingency 

in the English language) . Similarly, if the cipher text is seen to contain 
any digit repeated consecutively four or more times, the particular digit 
may be assumed to be a row coordinate; otherwise, such a sequence of re- 
peated digits would have to represent at least a threefold repetition 
of some one plaintext better (another rare event in English, although not 
as rare as that mentioned in the preceding sentence). 



5 The use of the term "b iliteral " in connection with digit cipher text 
may not be in conformance with the strictest rules of semantics, but the 
author feels that it is unnecessary to give a new name to an already- 
familiar type of distribution merely because it is being applied to a 
different kind of text. However, some who prefer to be purists in this 
matter term a digraphic distribution which is made on digit text as a 
"dinome distribution" or "dinomic distribution", and a biliteral distri- 
bution made on digit text, a "running dinome distribution". 

^ In the vernacular such a distribution is termed an "A-A" (pronounced 
"ay-dit-ay") distribution. 
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g. On occasion it may be found that much time has been spent in the 
attempt to identify the row coordinates, yet apparently with nob all of 
the coordinates being identified. In such a case, it may be found useful 
to consider those digits which are least likely to be row coordinates, 
specifically, those which occur least frequently m the cryptogram. The 
analyst may go through the troublesome cryptogram and place a slant bar 
(virgule) directly after each such digit as it occurs in the message. 
These marks may then be taken as an indication of places m the crypto- 
gram where one bona fide cipher unit ends and the next begins. The ana- 
lyst must then study the digits which directly follow these slant bars 
with a view to discovering new possibilities for row coordinates — possi- 
bilities which, although previously latent, have been made patent by this 
latest step. 

h. In the foregoing subparagraphs, a to g, the secondary step of 
testing for the corroboration or invalidation of any particular trial 
decomposition has been passed over quite briefly. Actually, this step 
is best described with specific examples of solution, and for this 
reason is treated in two subsequent paragraphs, 77 and 78 , with such 
examples. However, a few methods which can be applied for the rejection 
of incorrect hypotheses will be mentioned here, because they are rather 
basic and simple. If the cryptanalyst finds, after having divided a 
monome -dinome cipher on the basis of a particular hypothesis, that a long 
repetition in the cryptogram is not broken up in the same way on each of 
Its occurrences, he may well reject as incorrect the hypothesis on which 
the division is based. Likewise, the analyst may reject any hypothesis 
which requires him to make the last digit of a cryptogram a monome when 
this particular digit has to be considered as a row coordinate as part of 
the basic assumption .7 The presence of an inordinate number of consecu - 
tive monomes may cause one to suspect that a particular decomposition is 
Incorrect; however, probably only continued exposure to traffic of a 
certain type or involving one kind of enciphering matrix would provide 
pne with a sound basis for knowing Just how many are too many . 

i. There is one practical, straightforward measure for determining 
the relative goodness of an assumed decomposition which deserves particu- 
lar mention. It involves considering the ratio of the number of monomes 
produced in a particular decomposition to the number of remaining cipher 
units. In the case of monome -dinome ciphers, for example, 3n wnich an 
assumption of only two row coordinates is made, there can be no more than 
eight different plaintext letters represented by monomes and the total 
frequency of those monomes can not exceed the frequency expected of the 
eight most frequent letters in the language .8 Since in English the eight 
most frequent letters occur with a total relative frequency of 66$>, any 
trial decomposition giving rise to a ratio of monomes to dinomes which is 



7 

However, the possibility of a final null or nulls must not be ig- 
nored; the presence of nulls at the end of the cipher text would invalidate 
this reasoning . 

o 

0 The only exception to this statement would be a case wherein a word 
separator is included as part of the cryptosystem, and that this separator 
is renrcsented by a monome. This usage, however, seems rather unlikely. 
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considerably more than 66 to 3 ! i (» 1*9) Biay "be considered incorrect. 
Likewise, since an assumption of three row coordinates limits to seven 
the number of different, plaintext letters which may have monome equiva- 
lents and since the seven most frequent letters in English occur with a 
total relative frequency of 60$, any such assumption giving rise to a 
ratio of monomes to dinomes which is considerably more than 60 to 40 
(» 1.5) may be considered invalid. The author does, however, hasten to 
point out that a ratio which is smaller in any instance than the pertinent 
ratio, above, does not disprove the particular trial decomposition since 
the plaintext letters represented by monomes may not necessarily be the 
letters of highest frequency. The examples in the next two paragraphs 
will serve to clarify the foregoing considerations. 



77 « Analysis of simple examples . — a. The following cryptogram, 
suspected to be a monorae-dinome cipher ,”*18 available for study: 



10 



15 



20 



25 



30 



A 24090 15709 08 1 21 02092 92405 56001 
B 27072 90482 47607 09022 10209 29724 
c 07292 93257 52961 09042 72002 07247 
D 50570 96081 72409 29040 40971 24097 
E 29128 76090 40750 65297 09067 20902 



F 



09040 74076 



Cursory examination of the cipher text reveals nothing more significant 
than the fact that the digit 3 is absent; however, the significance of 
this escapes us for the moment. A uniliteral frequency distribution of 
the text is then made, as is illustrated below: 



s j? 

% — er 
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1234567890 



b. The uniliteral frequency distribution shows four marked peaks 
(2,7,9, and 0) and one pronounced trough (8). A biliteral frequency 
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distribution is made, as shows below, to assist in further evaluation of 
the properties of the cipher text. It is noted that the 2 and # rows. 
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0 



representing the two highest-frequency digits in the cipher text, have 
the most liberal combinations with the remaining digits; this would indi- 
cate that 2 and # are likely row coordinates of the cipher matrix. Since 
the 7 and 9 rows show less affinity of these digits for other digits, 7 
and 9 are less likely to represent row coordinates of the matrix; conse- 
quently the assumption is made that the matrix involved only two numbered 
row coordinates, 2 and #. 

c. The cryptogram is now divided accordingly, and the assumption of 
2 and"# as row coordinates is borne out hy the bipartite character of the 
following passages in the cipher text: 

( 1 ) . . ./ 21 / 02 / 09 / 29 M/ 05 / . . . (at All*) 

(2) * . ./07/09/02/21/02/09/29/ ... (at Bl4) 

(3) .../ 09 /oy 27/20/02/07/24/... ( at ci 6 ) 

(4) .../24/09/29/04/04/09/... (at D12) 

A frequency distribution of the decomposed text is made, as illustrated 
below: 



2 

0 

The percentage of monomes, 35$> does not exceed the threshold for the 
sum of the frequencies of the eight highest-frequency plaintext letters; 
furthermore, since the eight monomes have a much lower frequency than the 
sum of the eight highest-frequency letters in English, this is an indi- 
cation that some of the monomes represent plaintext letters of lower 
frequency. 

d. The decomposed text may now be solved, and the message is found 
to begin with the words "SABOTAGE PLAITS.,," The original matrix is recon- 
structed, and is discovered to be based upon the key word VEKIOUTH, as 
follows : 
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916 li 537302 

-fVERMOUTffr: 

0ABCDFGJJKL 
2 I ff F.Q S W X Y Z . 4 , 

She reason for the absence of the digit 3 In the cipher text may now he 
seen: the digit 3 forms a part of only the letters H, J, and Z, and 

these letters did not occur in the plaintext message. 

e. Solution of certain other cases of mixed-length systems pro- 
gresses as easily as did the solution of the foregoing example. 

(1) For instance, in the case of a cryptogram produced by a matrix 
with which the digits used for both the initial and final digits of di- 
nome3 are completely distinct from the monome digits (e.g., Fig. 75), it 
may he seen that ’‘eliminating” from the cipher text those particular di- 
gits which were used as monomes in the original enciphering alpnabet will 
leave the remainder of the cryptogram broken up into units all of which 
contain an even number of digits. (This would not be true In the case of 
other types of matrices, such as Figs. jl-'jk, since eliminating the digits 
which were used as monomes in the pertinent alphabet would remove not 
only actual cipher monomc3 but also the final digits of many cipher di- 
nomes.) Based on this fact, If one is confronted vith a cryptogram which 
he assumes to have been produced by a matrix such as thai in Fig. 75? he 
may use a mechanical method by means of which he will quickly be able to 
determine which digits are row coordinates and which are not, or, if his 
basic assumption concerning the type of matrix involved is incorrect, the 
error will quicKly become known lo him. He need only moke successive 
trials each of which involves considering a diffeient one of the 10 di- 
gits as being one of those which is a monome in the pertinent alphabet, 
"eliminating" the particular digit from the cryptogram, in each trial will 
inevitably lead to other digits which must also be eliminated throughout 
the cryptogram in order to maintain the stipulation that all the cipher 
units which remain must contain an even number of digits. For example , 

if one assumes that "0'* is a digit which was a monome, then he must fur- 
ther assume from a sequence of c ipher digits such as 050350 that "5” is 
also a digit which was a monome; and then likewise ”3” . Any particular 
one of the ten trials which is based on an incomect Initial assumption 
may be expected to end up with all ten digits being considered as digits 
which were monomes . 

(2) Tn the case of a monome-dlnome system in which the row coordi- 
nates of the enciphering matrix are distinct from the column coordinates 
(as in Figs. 73 and 7^)> solution is expedited by capitalizing on the 
fact that the digits within the family comprising the row coordinates do 
not (end cannot) contact themselves or any other digits within the family; 
using Fig. 73 as an example, it is obvious that the digits 7? 8? and 9 
can never be followed 'by a 7, 8, or 9. A cryptogram enciphered by such 

a system may be expected to contain much fewer cipher doublets than would 
a cryptogram, produced by a matrix without the foregoing limitation, and 
the doublets which do occur will themselves involve hut a limited number 
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of the 10 different digits. When solving such a cryptogram, the crypta- 
nalyst need only consider as possible candidates for row coordinates those 
particular digits which do not appeal in cipher doublets. Furthermore, 
he may with certainty go through the cryptogram placing a slant bar (to 
indicate the end of a valid cipher unit) after every occurrence of any 
digit; which has appeared in a cipher doublet. 

(3) If the cryptanalyst is confronted with a monome-dinome cipher 
which is the result of encipherment by means of a commutative bipartite 
matrix (see subpar. 75d and the accompanying Fig. 77 ), he knows that the 
first digit of the cryptogram must be a row coordinate. The analyst then 
has only to go through the cryptogram noting the digits which follow this 
row coordinate digit wherever it occurs in the cryptogram and, in this way, 
he may be able to identify all vhe column coordinate digits. Of course, 
by the process of elimination, he will then know which digits are row 
coordinates besides the initial digit of the cryptogram, and it will then 
be possible for him to divide the text into its proper irregular-length 
ciphertext units. 



78. Analysis of more complicated example s. — a. In sore cases, the 
rather simple methods of analysis applied in the preceding paragraph will 
not bear fruit, either because of the complexity inherent m the number of 
plaintext elements In the cipher matrix, or because of certain unpredic- 
table abberrations caused by the particular designations of the row and 
column coordinates. For instance, if a specific matrix contained only 
the highest-frequency letters in the top row, and if the matrix contained 
a fairly large number of plaintext elements (and therefore embodied 3 or 
4, or more, row coordinates), and if the elements in the dinome rows were 
balanced from the frequency standpoint, so that the rows would be used 
with approximately equal frequency, and if furthermore certain of the 
columns were composed of heavier elements than others (thus producing 
peaks that might incorrectly be identified as row coordinates) — all these 
conditions would yield a cryptosystem that might pose considerable dif- 
ficulties in the way of straightforward analysis. A case will now he 
studied that will illustrate typical techniques that would be necessary 
in more difficult circumstances. 



b. The following cryptogram has been intercepted on an enemy net 
known to be passing monome-dinome traffic: 

62719 4 4 0 8 1 21204 71270 550 4 2 12627 

09637 06212 24712 91724 21058 12727 

07055 58719 55721 04109 52847 71297 

23571 82123 94578 77571 80581 97654 



7 4 5 7 2 

69051 

5 4 7 4 2 



0 5 19 1 

1 5 7 2 4 

78271 



7 7 19 4 
7 13 8 9 
7 2 3 2 7 



5 2 9 5 8 

^7316 

05504 



7 0 0 1 2 
7 9 0 3 5 
5 8 2 5 5 



1 2 2 5 1 

4 7 3 5 9 

5 5 9 1 8 



^38 
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The uniliteral frequency distribution for the cryptogram is shown below: 



gfiir ar I 

£ % % % 

5 M ^ H 

1 2 3 * 567890 

e . From the appearance of the uniliteral frequency distribution, it 
is to be expected that from among the four peaks (l, 2, 5, and 7 ) some row 
coordinates must be represented, and since there is not much variance In 
frequency among these peaks, perhaps all four represent row coordinates* 

In an attempt to obtain as much information as possible from a study of 
the frequency characteristics of the cipher text, a billteral distribution 
is made and is shown below* 



1 

2 

3 

h 

5 

6 

7 

8 

9 

0 



Examination of this latter distribution adds support to the impressions 
gained from the unilifceral frequency distribution, namely, that the row 
coordinates for the cipher matrix are very likely to he found among the 
digits 1, 2, 5, and 7* Furthermore the digit 7» because of its high fre- 
quency and because of satisfactory combinative qualities in the hillteral 
distribution, is selected as a definite row coordinate— this will reduce 

the number of trials that must subsequently be considered. 

* 

d. If all of the row coordinates of the cipher matrix are found among 
the various combinations of 7 with 1 , 2 , and 5 > then it is clear that: 

(1) if there are but two coordinates of the matrix, these must be 
either 7 and 1 , 7 and 2 , or 7 and 5 , (three cases); 

( 2 ) if there axe three coordinates of the matrix, these must he 
either 7 - 1 - 2 , 7 - 1 - 5 , or 7 - 2-5 (three cases); or 

( 3 ) if the matrix has four numbered coordinates, this must entail 
the combination of 7 - 1 - 2-5 (only one case). 



1 2 3 k 5 6 7 8 9 0 
1 il l - 123 3 5 3 
7 23321913 2l 
1 1 - - 3 - 1 l 1 ■ 
i 3*39-6521 
3 2-39-6522 
- 31-1-1-1 -i 
10 72211321 7 
3 3-1 — 3-11 
3 . . U 1 2 . - 2 
1 - 1 k 7 1112 1 
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e. On the "basis of each of the foregoing seven hypotheses, the cipher 
text is divided and the resulting frequency distributions are shown below: 



1 

it 



2 3 4 
T7tTF_ 

8 » w 

7 a g* 






6 

T 

2 

1 



6 



£ 









2 


mlm 


7 





12 3 



4 5 6 7 8 9 0 
ilia I?] 

6 2 1-1 18 13 1 
6622-13113 



Case II 



1234567890 



16---23132 

412121612- 

7522-12114 



Case IV 




Cnee V 



Case VI 



1 

2 

5 

7 




Case VXX 

f . In order to be able to evaluate the relative merits of the seven 
hypotheses and choose the case which is most likely to be correct, it is 
possible to resort to a method wherein group frequencies of the high-fre- 
quency elements from each of the deccmposl’tlo'ns' are" studied. In the fol- 
lowing table drawn up for this purpose, the column of figures under "x" 
denotes the cumulative twelve highest-frequeney ciphertext units; under 
M H", we have the actual frequencies of the first, the first two, the first 
three..., the first 12 highest-frequeney ciphertext units for each hypo- 
thesis (compare with the distributions in subpar. e); in the adjoining 
column to the right of each "N" column, the various cumulative frequency 
values are expressed as percentages of the total number of ciphertext 
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I 


II 


III 


si 

<5 3 
| 


V 


VI 


VII 


P 


A 


W 


158 


II 


IT 

761 


s 


IT 

157 


IT 


IT 

147 


J 


N 

Ip 


IT 


n 

139 


H 


N 

129 




1 


31 


19.6 


30 


18.6 


25 


15.9 


29 


19.7 


16 


11.6 


17 


12.2 


12 


9.3 


13.0 


2 


48 


30.4 


48 


29.8 


46 


29.3 


44 


29.9 


29 


21.0 


31 


22.3 


24 


18.6 


22.2 


3 


64 


45.0 


64 


39.8 


60 


38.2 


57 


38.8 


42 


30.4 


43 


30.9 


34 


26.6 


30.2 


4 


77 


48.7 


79 


49.1 


73 


46.5 


67 


45.6 


51 


37.0 


54 


38.8 


4l 


31.8 


37.8 


5 


87 


55.1 


91 


56.5 


85 


54.1 


77 


52.4 


60 


43.5 


62 


44.6 


47 


36.4 


45.3 


6 


96 


60.8 


102 


63.4 


92 


58.6 


CO 


57.1 


67 


48.6 


70 


50.4 


52 


40.3 


52.7 


7 


105 


66.5 


110 


68.3 


99 


63.1 


90 


61.2 


74 


53.6 


76 


54.7 


57 


44.2 


60.1 


8 


113 


71-5 


116 


72.0 


105 


66.9 


96 


65.3 


80 


58.0 


81 


58.3 


62 


48.1 


66.2 


9 


120 


75.9 


122 


75.8 


111 


70.7 


101 


68.7 


86 


62.3 


86 


61.9 


67 


51.9 


70.4 


10 


127 


80.4 


128 


79.5 


117 


74.5 


105 


71.4 


91 


65.9 


91 


65.5 


71 


55.0 


74.0 


11 


133 


84.2 


133 


82.6 


122 


77.7 


109 


74.1 


96 


69.6 


96 


69.1 


75 


58.1 


77. ^ 


12 


137 


86.7 


138 


85.7 


127 


80,9 


113 


76.9 


101 


73.2 


101 


72,7 

x\.'rcjsses^ 


79 


61.2 


80.5 



units which remain after the particular trial decomposition. The column 
labelled "P M gives the cumulative theoretical frequencies of the first 
12 letters in English plain text. 

g. It is noted that in Case I, the most frequent ciphertext unit 
has a percentile frequency of 19.6$, the highest two units, a percentile 
frequency of 30.4$; the highest three, a percentile frequency of 45.0$. 
When these percentages are compared with the percentile frequency of the 
highest-frequency letter in English plain text (13.0$), of the highest 
two letters (22.2$), and of the highest three letters (30.2$), it is clear 
that Case I does not conform to the characteristics expected of a simple 
monoalphahetic substitution, therefore Case I is not the correct division 
of the cipher text. Similarly, Cases II, III, and IV can also be rejec- 
ted because the cumulative values are much higher than the corresponding 
expectations for plain text. Case VII, on the other hand, demonstrates 
values much lower than the corresponding expectations for plain text; 
therefore this case too is rejected. This leaves only Cases V and VI, 
both of which show a close correspondence with plaintext expectations. 

h. If there were nothing else in the manifestations of the decom- 
posedT’cipher text in Case V and Case VI, these two cases would have to 
be tried in turn, making some tentative plaintext assumptions ) of course, 
only the correct case would consistently yield plain text. However, there 
is an additional bit of reasoning which may be applied here as a means of 
deciding which of these two remaining cases is more likely to be correct 
and ought to be worked on first — namely, it may be reasoned that cipher 
text which has been decomposed according to an incorrect hypothesis will 
be likely to contain a larger ratio of monomes to dinomes than would the 
same text if it had been decomposed according to the correct hypothesis .9 



9 This intuitive reasoning has been borne out by empirical observation. 
30 monome-dinome ciphers of an average length of 100 digits were decomposed 
in all possible ways based on the hypotheses of two, three, and four row 

coordinates. In the case of of these ciphers, the correct decomposition 

yielded a monome -to -dinome ratio which was lower than the monome-to-dinome 
ratio yielded by any of the incorrect decompositions . 
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Case V has a monome -dlnome ratio of .916 whereas Case VI has a correspon- 
ding ratio of 1.04-3; thus Case V is indicated as the case which is more 
likely to he correct. 

jL. The cipher text is now divided according to the hypothesis of 
row coordinates of 1, 5, and 7, and the plain text is quickly recovered, 
facilitated hy the pattern of the first word, RECONNAISSANCE • The cipher 
matrix is reconstructed as follows; 




The reason for the high frequency of the cipher digit 2 is now seen; the 
combined frequencies of Ep, S p , and U p contribute to an inordinate peak 
for that column coordinate. 

In retrospect, several important points may be noted in the so- 
lution of this particular cryptogram. First of all, the four consecutive 
5’s in the last two groups of the cryptogram make it a very strong proba- 
bility that 5 is a row coordinate, otherwise the four 5's would mean a 
threefold (or even fourfold) repetition of a monome letter, a compara- 
tively rare contingency. Secondly, the digit 1 could have been selected 
as a row coordinate with considerable certainty, based on the fact that, 
since the dincme 12 was the highe s t - fr e queue y element in the bi literal 
distribution, it may be assumed that at least a number of 12's were causal 
and therefore 1 must be a row coordinate. In other words, the correct 
set of coordinates might have been established at the very beginning of 
the analysis, but for pedagogical reasons it was felt necessary to pro- 
ceed along the general lines of the solution as given. It is to be noted 
that, since at the start of solution we did not know exactly how many 
numbered row coordinates there were in this particular case, we could not 
apply the ratio of monomes to dinomes at once as the deciding criterion* 

k. If mixed- length systems were encountered in actual practice, 
after”" the type of matrix became known through solution of several days * 
traffic, solution of subsequent days' messages would be facilitated be- 
cause by this time the analyst would be familiar with the general type 
of matrix used. This knowledge would be of great assistance in making 
assumptions as to the nature of subsequent matrices. In some cases, the 
internal arrangement of the matrix might remain fixed, with only the co- 
ordinates being changed periodically, in other cases, the internal ar- 
rangement and the coordinates of the matrix might change, with only the 
size of the matrix remaining fixed. If it were known, for instance, that 
We enemy were using a monome -dinome system with only two numbered row co- 
ordinates, then there would only be 10 x 9 or 4-5 exhaustive trials (if 

1x2 

these had to be made) which would be necessary to guarantee reaching the 
correct decomposition of the cipher text; if there were three numbered 
coordinates, then there would be a maximum of 10 x 9 x 8 or 120 trials 

1x2x3 

necessary to insure reaching the proper scheme for the decomposition of 
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the cipher text .10 such trials, although laborious (and ordinarily un- 
necessary) when made by manual methods, would be by no means prohibitive 
if there were available machine processes for assistance. Exhaustive 
trials would rarely be necessary, except in very difficult cases; in the 
majority of instances, straightforward methods of cryptanalysis would 
reduce the large number of theoretical trials to but a few, from which 
the correct selection could be made. 

l. If the exact composition of the internal arrangement of the 
matrix were known, this knowledge would be useful in determining how the 
letters of assumed cribs would be enciphered as ironoiros or dlnoxes. In 
any cose, if a word of pronounced idiomorphic pattern is assumed, no mat- 
ter bow the letters of the word are encrypted as monomes or dinomes, the 
idiomorphism must be patent in the cipher text; for example, the word 
ARTILLERY in a rconome-dinome system must have a consecutively repeated 
monome or dinome representing Lp, closely flanked on both sides by some 
particular monorne or dinoire representing Rp. If uneneiphered numbers 
were to appear in the encrypted text, bracketed by an indicator to signal 
that numbers begin and end, the recognition of these plaintext numbers 
would enable the analyst to identify the Indicator, and thus, lead to 
the establishment of one row coordinate. 

m. It must be pointed out that mixed-length systems, even more so 
than other types of systems treated in this text, often present unusual 
problems for the cryptanalyst. Each esse is a distinctly special case,^- 
but continued practice in the solution of these types of systems should, 
as in other situations, cultivate skill and develop abilities in this 
field. 

n. The student may have noted that no mention has been made con- 
cerning the possible use of the <j> tost as a means for determining whether 
or not a particular trial decomposition represents b! e proper reduction 
of a cryptogram to monoalpliabetic ter^s. The <j> test has been ignored 
throughout this Section because, when dealing with cipher alphabets which 
include plaintext elements other than single lcttois (e.g., such elements 
as syllables, numbers, indicators, etc.), the value of <fp can only be 
loosely approximated, further] »re, computation of the valve of <j> r in a 
mixed-length cipher is also a rather tenuous matter. For this reason, 

it has been considered best to describe only methods of solution which 
do not depend at all on the use of the i test, and thus keep from estab- 
lishing in the mind of the student any doubt as to the usefulness of this 
test when applied in other instances, such as those described in earlier 
sections of this text. 



10 The number of combinations of H things taken p at a time is given 
by the formula jjC r - Hi ; thus Tor the assumption of 3 numbered 

rl(H-r)T 

rows In a monome-dinome matrix, mCo = 10*9 876 5*4 3*2*1 • 10*9*8 s 120 . 

3*2*1 (7 6*5 4-3-2-1) 3*2*1 

The notation Hi is read as "factojial H.* 

^ And, as one cryptowag has pointed out, some cases are more special 
than others. 
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79. Further remarks on cryptosystems employing Irregular -length 
ciphertext units * — -a. The subject of the diagnosis or identification of 
'mixed-length cipher systems has not been discussed. This problem can 
sometimes be extremely difficult in complex cases; however, the general 
statement can be made that one takes advantage of any phenomena of repe- 
titions that are present in a cryptogram to arrive at the conclusion that 
a mixed-length system has been encountered. If the repetitions present 
are separated by numbers of letters without a constant factor, or if the 
interval between repetitions is a prime number, and if the possibility 
of a null or nulls (of a different size than the real cryptographic units) 
has been considered and ruled out, then in all probability the cryptogram 
involves some sort of mixed-length cipher units , As to exactly which 
kind of mixed-length system is involved, this question can be answered 
only by detailed analysis, sometimes to the point of actual plaintext 
recoveries in order to he certain about one's conclusions. 12 

b. It is not imperative that a mixed-length cipher system be pro- 
duced through the medium of a matrix with row and column coordinates. 

For example, in one cryptogram that was submitted for solution, the cipher 
text began as follows: 



Q K T 2 Q 
K K T 2 K 



3 K B 3 K 
K T 2 K B 



QKTQK 
3 Q K T Q 



T 3 Q K T 
B Q R K 3 



2 K B 3 Q 

KQ2QK 



KTQE2 



T 2 Q R 2.... 



The entire cryptogram, containing 490 characters, consisted only of the 
seven symbols B, K, Q, R, T, 2, and 3 • When this cryptogram was solved, 
the following alphabet was recovered: 



A «K3 
B KR3 
C « QB3 
D = KB2 
E »KB3 
F = KKT2 



G = KR2 

H = Q3 
IJ = QKT3 
KQ = K2 
L = KKT3 
M = QR3 



N » Q2 
0 » QR2 
P = QR 
R = QKT 
S = QB 
T = QKT2 



U = Q 

V = QB2 
W = K 

X = KB 

Y = KKT 
Z = KR 



To the reader who is a devotee of the royal game, it will be apparent 
that the foregoing alphabet is based upon chess notation.^ If however 
the digits 1-7 had been used in lieu of the symbols above, the cryptogram 
could still have been correctly divided into its component ciphertext 
groupings of 1 , 2, 3, and 4 digits, based upon an interpretation of the 
characteristics present in the cipher text, and of the phenomena in a 
triliteral distribution showing one prefix and one suffix. 1 ^ 



12 Cf. the discussion of diagnosis in subpar. 69f. 

1 3 The chess -playing reader nngnt be interested m recovering the key 
word for this alphabet. 

The interested student could make up a cryptogram using seven 
characters in this fashion, so n* 1 coold see for himself the methods of 
attack on such a system* 
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c » The concept of irregular-length cryptographic units can he 
applied to many varieties of systems, "both code and cipher* For example, 
in Fig. 89, "below, there is illustrated a four-square matrix in -which 
plaintext digraphs are represented hy ciphertext dinomes, trinomes, or 
tetranomes. The positioning of the monome3 in the ciphertext portions of 
the matrix was governed hy the frequencies of individual components of 
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Figure 89 

four-square cipher digraphs, ^5 thus permitting opti™™ compression of 
the cipher text, i.e*, allowing the most liberal use of ciphertext dinomes 
and trinomes rather than the maximum cipher length of tetranomes; for 
example, the word REGIMEKEAl would he encrypted RE GI ME HE AL. 

76 8l4 06 68 1018 

4. The matrix for another mixed-length cipher system, employing 
dinomes and trinomes for the encryption of plaintext digraphs, is shown 



15 



See Appendix 2, Table 13, 



"Four-square individual frequencies 
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in Figs. 90a and b. Using this matrix, the word DIVISION is encrypted 
as 07 883 32 746.*” It is noted that consonant-vowel digraphs involving 
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eight high-frequency consonants with five vowels are represented by di- 
nomes , and all other plaintext digraphs are represented by trinomes. In 
those rare cases where, as in the example MU ZZ LB, an " impossible n di- 
graph appears in the plain text, the insertion of the letter Kp in the 
plain text at that point m question, similar to the normal Playfair doub- 
let convention, enables the encryption of the word, as MU ZK ZL E. A 
better variation of the foregoing system might incorporate a dinome matrix 
for the 40 highest-frequency digraphs (comprising 42$ of English plain 
text) such as that illustrated in Fig. 91, and a trinome matrix modified 
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in suitable fashion for the remaining digraphs. Such a scheme would 
yield a greater condensing property for the cipher text, hut would not 
he as easy to u se as the system described above since the easy mnemonic 
feature of the matrix in Fig. 90h would he lost. 

e. Another idea for a cryptosystem having irregular-length cipher- 
text groupings employs the diagram in Figs. 92s and h. This scheme in- 
corporates Playfair digraphic encipherment (with biliteral cipher equiva- 
• lents) and monographic encipherment (with un3 literal cipher equivalents). 
In order to disturb the regularity of usual digraphic encipherment (pro- 
duced by the Playfair -type matrix 3n Fig. 92a), certain selected me&ium- 



B G M V W 
W V M G B 



Figure 92a Figure 92b 

frequency consonants are enciphered monographically and uni literally by 
the reciprocal alphabet shown in Fig. 92b. Using Fig. 92, as an example, 
the phrase BRIGADE OF EHEMY IEFAMRY MOV! LG . . .woul d be broken up and 
enciphered as follows : 

BRIGADEOFEKEMYIMFMTRYXMOXVING 
¥ PL V CE AR AF IF M UL SN FH YO ZY M QT G EH V 

The cipher text, regrouped into fives, UPLVC EARAF LFMUL SEFHY OZYMQ TGKHV, 
reveals no indication of the uniliteral -biliteral encipherment involved. 
Since the letters BGMVW represent 8.2$ of normal plain text, there is ap- 
proximately 8$ interruption of the regularity of normal digraphic text. 
Furthermore, since it is expected that about half the time these letters 
will occur as singles in the plain text, and about half the time as In- 
terruptor letter (such as Xp in the example above) will have to he used, 
this scheme is accomplished by adding only about 4$ to the length of the 
original plain text. Other variations of the basic idea are found m 
Figs. 93 and 94; In Fig. 93, the Playfair matrix is a 6 x 4 rectangle 
omitting S and Y, and these two letters form a reciprocal monographic 
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encipherment convention; In Fig. 94, the Playfair matrix is the normal 
5x5, "but with the convention that, unless Ep is the second member of a 
digraph in the process of encryption, Ep Is represented monographically 
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"by J c . In the foregoing two figures, the ST of Fig, 93 could he replaced 
of course hy any other two letters whose combined frequency is in the 
neighborhood of 6 - 10 $, and the monographic E p of Fig, 94 could be replaced 
by any other high" or medium-frequency letter . Instead of Playfair 
matrices, the digraphic portions of the enciphering schemes of this sub- 
paragraph could be accomplished by the use of any other small -matrix 
digraphic methods. 



f. The Morse code, consisting as it does of irregular-length units 
composed of dots and dashes, lends itself to interesting cryptographic 
treatments. For example, the dots and dashes (and, of necessity, the 
spaces between Morse characters) might be encrypted by means of the table 
illustrated in Fig. 95, wherein each of the three elements has approxi- 
mately the same number of variants. A better idea, however, is to employ 
variants in the proportions of dots (42.4$), dashes ( 29 . 1 $), and spaces 



dot: ABCDEFGEI 

dash: JKLMNOPQR 

space: STUVWXYZ 

Figure 95 



dot: HYDRAULICBE 

dash: FGJKMNOP 

space. Q S T V W X Z 

Figure 9 6 



(28.4$) of the letters comprising normal English plain textj such a scheme 
for variants is shown in Fig. 96 . Thus, using the example of Fig. 96 , 
the word ENEMY (which in Morse code is • -• — -• — ) might be en- 

crypted as ES MDW CQ NFV P3KGZ, which would then be regrouped in fives 
for transmission. Other ideas for the encryption in digit form of Morse 
code systems might incorporate alphabets such as those illustrated in 
Figs. 97 and 98 "below: 1° 



dot: 12 3 4 

dash: 5^7 

space: 890 

Figure 97 



dot: 13579 

dash: 2 4 6 8 

space: 0 

Figure 98 



g. Space does not permit detailed examples of analysis of some of 
the foregoing systems. Admittedly, some of them would pose considerable 
difficulty in the way of solution, however, if these systems were used 
in actual practice, then operational cryptanalytic methods and entries 
would make possible successful solution. 

1 Further ideas of cryptosystems based on the Morse code will be 
treated in Military Cryptanalysis, Part IV. 
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SECTION XI 

"* ” _ "* a. 4H 

IDESCELLAIffiOUS KOKOAIPHABETIC SYSTEMS, COFCLUDIIIO miAIffiS 

~ ' *“ “ ■ ‘ Paragraph. 

Cryptosystems employing cyllpbcuy squares and code charts 80 

Cryptosystems employing characters other than letters or figures. ... 8l 
Special remarks concerning the initial classification of cryptograms.. 82 



Disgui red secret comniunicat Ions 83 

Concluding remarks 84 



P o. C ry ptosystems employing syllabary squares and code charts . - - 

a. Tne various cryptoSyr terns treated in the piecoding sections of 

this text have m the main fallen Into either the multillteral category 
or the polygraphic category This and the next fev subparagraphs will 
xi cat of systems which represent a merger of these two categories — 
maelYy biliterel systems which have as plaintext elements not only 
single letters end digits, but also certain polygraphs selected for the 
condensation in ciphci text that their usage may permit In addition, 
treatment will be made of bllitcral systems whicn Involve, as plaintext 
uni 1 / 3 , a selection of frequent words (that Is, which occur frequently 
An the typo of traffic for xnlrh the particular cryptosystem Is Intended) 
and perhaps some common phrases, such as "reference your message number", 
"request acknowledgment" , "nothing to report", etc. Systems which embrace 
digiapha, trigraphs and other polygraphs as plaintext elements in addition 
to single letters and digits ore called syllabary systems because the ad- 
ditional Inclusion of these polygraphs permits the encryption of plain 
text in a syllabic or quasi-syllabic fashion, most systems of this type 
involve bipartite matrices In the cryptographic scheme, and these matrices 
are called syllabary squares . When the matrix in this general typo of 
system also incorpoi ates words among the plaintext elements, the matrix is 
termed a code chart . . " 

\ " 

b. The category of systems embodying syllabary squares and code 
charts’ as the cryptographic vehicle actually constitutes a transition 
between cipher and code systems, 1 since a syllabary square or a code 
cnaru may be regarded equally properly as either a special type Of ci- 
pher or a primitive code. However, because syllabary systems folDov 
very closely on the ideas of bipartite matrices, these systems are in- 
cluded in this particular text Instead of being reserved for treatment 
In a subsequent text. 



1 See definitions of the terms cipher and code in the glossary. 
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A sample syllabary square Is illustrated in Fig 99, below 

123 lt 567890 
lpA I AL AN AMD AR ABB AS AT ATE 

2 ATX B 2 BE C 3 CA CE CO COM 

3D !)• DA DE E 5 EA ED EH ENT 

It EB ERE EES ES ESI F 6 G 7 H 

5 8 HAS HE I 9 IN IHG ION IS IT 

6 IVE J 0 K L LALEM MEN 

7 in) HE NT 0 OF ON OR OU P Q 

8 R RA RE RED RES RI RO S SE SH 

9 ST STO T TE TED TER TH THE THI THR 

0 

Figure 99* 




It will be noted that the square contains the 26 letters, the 10 digits, 
and Gv digraphs and trigraphs chosen both on the basis of frequency con- 
siderations and the combinative potentialities of the particular poly- 
graphs; the internal arrangement of the square xs such as to permit the 
easy finding of the plaintext elements to be enciphered. Other matrices, 
of larger dimensions, may contain not only a larger number of different 
plaintext elements within the matrix, but may also duplicate some of the 
more frequent plaintext elements and thus incorporate plaintext variants 
within the matrix. Furthermore, when letters are used as coordinates, 
variant cipher equivalents may be incorporated into the scheme 
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has been used as a standard tactical cryptosystem for ground forees by 
AGCRESSOR, the maneuver enemy 5 n U.S. joint maneuvers and train! ng ex- 
ercises. This chart provides 2-letter equivalents for letters, numbers, 
syllables, and a selection of "words "which occur frequently in low-echelon 2 
messages. A particular plaintext value may be designated by a combination 
of one of the two row coordinates and one of the two column coordinates 
of the cell containing the plaintext value; thus each plaintext element 
has four variant equivalents and, for example, the word ARTILLERY con- 
tained in the chart may be encrypted in toto as TE, TI, QJT, or QI 
When a complete word contained in the chart is to be encrypted in a mes- 
sage, no designator is necessary to indicate this low ercase meaning How- 
ever, when upper-case meanings (i.e*., letters, number's, and syllables) 
ere to be encrypted, it is necessary first to encrypt the designator 
"Spell/ fig Begins", followed by the cipher equivalents of the particular 
upper-case meanings; when the spelling is completed, the designator "Spell/ 
fig Ends " is encrypted , to show the return to lower-case meanings . The 
coordinates of the chart, as used by AGGRESSOR, were random sequences and 
were changed daily, the inside of the chart remained unchanged 

e For the most part, the steps used in the recovery of plain text 
from messages involving syllabary squares differ from those used in the 
solution of previously-discussed multiliteral and polygraphic systems only 
in that a larger number of plaintext elements may have to be considered. 

The cryptanalyst must accordingly modify his interpretation of the fre- 
quency characteristics and idiomorphic patterns occurring in such messages. 
By a careful study of the behavior of frequently recurring cipher units, 
the analyst is led to conclude that certain units, because of the general 
characteristics they exhibit, must be representative of numbers, others of 
punctuation, others of single letters, and so on. This classification is 
based upon a knowledge of the general behavior of the various classes of 
plaintext elements For example, expher units representing digits may be 
expected to appear in clusters (as in dates and time, and the designations 
of topographical features, such as hills, road junctions, etc.); whereas 
those which represent punctuation may be expected to appear at varying 
intervals throughout the message text (the particular intervals being de- 
pendent upon the particular punctuation mark) When this classification 
has proceeded upon a solid foundation far enough, each set of cipher units 
is underlined throughout the text in some distinctive manner by means of 
colored pencils. Subsequent to this, the individual members of each class 
of cipher units are subjected to closer scrutiny, and based upon a know- 
ledge of the specific behavior of the various elements in each class, 
specific units are identified as having specific plaintext meanings. For 
example, among those cipher units which the analyst has decided constitute 
tne class which represents plaintext digits, the particular cipher unit 



2 

The term low-echelon as applied to a cryptographic system means that 
the system is designed for use at the lower organizational levels such as 
(in the army) at the regimental level and below. The term low-grade as 
applied to cryptosystems means that the inherent security afforded by the 
system is low. Cf < the terms medium-echelon and -grade , and high-echelon 
and -grade . ™ ™ 



CONFIDENTIAL 



251 



CONFIDENTIAL 



REF ID:A56892 



representing plaintext "0" may be expected to bo readily recognizable on 
the basis that (l) it is one of the thiec units which appear as the first 
unit in those clusters which arc suspected of repieoenting four-digit 
time designations on^ (2) it is one of the two cipher units which, with 
any noteworthy frequency, occur doubled at the end of the some four -unit 
c Listers . 

f When working on messages involving code ehai to , tho cryptanalyst 
usually starts by attempting to isolate sequences of cipher units which 
represent plaintext letters., syllables, numbers and punctuation Subse- 
quent to this he proceeds to classify and identify those particular cipher 
units in the manner described in the foregoing subparagraph; the recovery 
of word meanings is usually accomplished much later. The isolating of the 
ciphertext units which represent syllabary portions may be readily accom- 
plished in those cases wherein the underlying code chert ha3 only one 
"Spell/fig . Begins" group and one "Spell/fig Ends" group, since the 
recognition of these designators automatically permits one to divide the 
ciphei text into word values and non-word values, the recognition of these 
designators is made on the basis of their high frequency and their alter- 
nating placements throughout the cipher text 

g . Ac plaintext meanings are recovered in a syllabary square system 
or code chart system, those meanings should be entered into a skeleton 
matrix in a manner similar to that used in the solution of the bipartite 
systems previously described (Sections VII and VIII) . This is done in 
order to uncover and exploit as early as possible any evidences of syste- 
matic construction arising from the anangoment which was used in tho under- 
lying matrix. It may be assumed that each syllabaiy square and code chart 
will normally have had Its internal elements arranged in some type of sys- 
tematic fashion in order to permit the ready finding of plaintext elements 
during the encryption of a massage 

h. When there are special circumstances involved, for instance, when 
the contents or the exact internal construction of the matiix is knotm, or 
when the arrangement of the outside coordinates is knenm, or when messages 
with isologous syllabary portions (i.e., spellcd-out portions encrypted 
"off-the-cut", such as IN TER CE P TO R and I KT ER CE P T OR) arc present 
in the ciphor text, solution is naturally considerably facilitated. Even 
when only a single message is available, if the matrix is known there may 
be special appioaches to solution, based on the nature of the plaintext 
elements constituting each row and each column of the particular matrix. 

For instance. If the words KEPERTECE and YOUR and MESSAGE are known to he 
in the same row of a particular code chart, then it would be quite possible 
that the ciphertext sequence LA LH LT at the beginning of a message repre- 
sents the stereotype RETEEEITCE YOUR j. LESS AGE, if but a few other similarly 
identifiable sequences weie also available to the cryptanalyst, he could 
possibly recover the ariangement of the outside cooidinates after a relative]/ 
few steps . „ , 

G l Cryptosystems employing characters other than lettcis or figuiea _ — 

a In practical cryptography today, the use of chuecxeis other than 
the letters of bona fide alphabets (including recognized Norse and Baudot 
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alphabets) or the 10 digits is comparatively rare. Whan so-called symbol 
ciphers, that is, ciphers employing peculiar symbols, signs of punctuation, 
diacritical marks, figures of "dancing men”, and so on sjlc encountered in 
practical work nowadays, they are almost certain to he simple monoalphahetlc 
ciphers. They are adequately described in romantic tales, 3 in popular hooks 
on cryptography, and in the moro conimon types of magazine articles. No 
further space need he given ciphers of this type in this text, not only be- 
cause of their simplicity hut also because they are encountered in military 
cryptography only in sporadic instances, principally in censorship activities! 
Even in the latter cases, it is usually found that such ciphers are employed 
in "intimate” correspondence for the exchange of sentiments that appear 
less decorous when set forth in plain language . They are very seldom used 
by authentic enemy agents Wlien such a cipher is encountered nowadays it 
may practically always he regarded as the work of the veriest tyro, when it 
is not that of a crank or a mentally-deranged person. 

h. The usual preliminary procedure in handling ouch coses, where the 
symbols may he somownat confusing to the mind because of their unfamiliai 
appearance to the eye, is to substitute letters for them consistently 
throughout the message and then treat the resulting text in the manner in 
which an ordinary cryptogram composed of letters Is treated. This procedure 
also facilitates the construction of the necessary frequency distributions, 
which would be tedious to construct by using symbols . 

£. A final word must be said_on the subject of symbol ciphers by way 
of caution When symbols are used to replace letters, syllables, and entire 
words, then the systems approach code methods in principle, and can become 
difficult of solution *■ The logical extension of the use of symbols in such 
a form of writing is the employment of azbii/raiy characters for a specially 
developed "shorthand" system bearing little 01 no resemblance to well-known 
and therefore nonsocret, systems of shorthand, such as GregG, Pitman, etc. 
Unless a cons idex able amount of tgxt is available foi analysis, a privately- 
devised shorthand may he very difficult to solve. Fortunately, such systems 
are rarely encountered in military cryptography. They fall under the heading 

3 The most famous: Edgar Allan’ Poe ♦ s ^ The ^Gold Bug j Sir Arthur Conan 
Doyle’s The Adventure of the Dancing Map ,' tmles Verne’s A Journey to the 
Center of the Earth . 

^ The use of oy&bols for abbreviation and speed in writing goes bock to 
the dayB of antiquity. Cicero's freedman and amanuensis. Tiro, is repoited 
to have drawn up "a hook like a dictionary, in which he placed before each 
word the notation (symbol) which should zepresent it, and so great was the 
number of notations and words that whatever could he written in Latin 
could he expressed in his notation ." The designation "Tlronian notes" is 
applied to this type of shorthand. 



f 



e ONFIDENTIAL 



253 




CONFIDENTIAL 



REF ID:A56892 



or cryptographic curiosities, of Interest to the cryptanalyst in his 
leisure moments .5 

82 Special remaxks concerning the initial classification of crypto - 
grams . — a The student should by this time have a good conception of the 

basic nature of monoalphabetic substitution and of the many vai iations 
■which may be played upon this simple tune The first step of all, natu- 
rally, is to be able to classify a cryptogram properly and place it in 
either the transposition or the substitution class . The tests for this 
classification have been given and as a rule the student will encounter 
no difficulty in this respect. 

b. There are, however, certain kinds of cryptograms whose c]aso 
cannot he determined in the usual manner, as outlined in par. 25 of this 
text. First of all there is the type of code message which employs bona 
fide dictionary words as code groups . Naturally, a frequency distribution 
of such a message will approximate that for normal plain text. The ap- 
pearance of the message, howevex , gives clear indications of what is in- 
volved The study of such cases will be taken up in its proper place. 

At the moment it is only necessary to point out that these arc code 
messages and not ci pher , and it is for this reason that in pars . 2 F and 
25 the words "cipher" and "cipher messages" are used, the word "crypto- 
gram" being used only where technically correct . 

c . Secondly, there corns the unusual and borderline cases, including 
cryptograms whose nature and type can not be ascertained from frequency 
distributions. Here, the cryptograms arc technically not ciphers but 
special forms of disguised secret writings which are raroly susceptible 
of being classed a3 transposition or substitution. These include a large 
share of the cases wherein the cryptographic messages are disguised and 
carried under an external, innocuous text which is innocent and seemingly 
without cryptogi aphic content— for instance, in a message wncrcin spe- 
cific letters are indicated in a way not open to suspicion under censor- 
ship, these letters being intended to constitute the letters of the 
cryptographic messages and the other letters constituting "dummies 
Obviously, no amount of frequency tabulations will avail a competent, 
expert cryptanalyst in demonstrating or disclosing the presence of a 

cx yptographic message, written and secreted within the "open" massage, 
whxch serves but as an envelop and disguise for its authentic or real 
Import . Certainly, such frequency tabulations can disclose the e cistence 
neither of substitution nor transposition m these cases, since both forms 
are absent. The next paragraph contains more about these latter coses .6 



15 

An example is found in the famous Pepys Diary, which was Witten m 
shorthand, purely for his own eyes by Samuel Pepys ( 1633-1703) "He wrote 
it in Shelton’s system of taehygraphy (l64l) , which he complicated by 
using foreign languages or by varieties of his own" invention whenever he 
had to record passages least fit to be seen by his servants, or by 'all 
the world . ' " 

^ The subparagraph which the student has just read (82c) contains a 
hidden cryptographic message. With the hints given in par. 83 let the 
student see if he can uncover it 
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83 . Disguised secret conrounigatlons —a As was mentioned" above, 
there i s a general class pi methods of secret writ ing in which a secret 
mess ago 10 concealed within the text of an apparently innocuous plain- 
text message, also, hy extension, a secret message may he concealed within 
otherwise horn fide media such as maps, drawings, charts, music manu- 
scripts, "bridge hands, chess .problems , shopping lists, stock quotations, 
and so on The addressee of such a communication, knowing where to look 
for the secret elements, does so and from them is able to read the message 
contained within its covering disguise . When the plaintext elements of 
the secret message are concealed by surrounding them with the plaintext 
elements of an Innocent cover toxt, such a system 13 known as a conceal - 
ment system When, however, the actual plaintext elements of tho secret 
message are not themselves concealed within a cover text, but instead 
have code equivalents which are themselves actual plaintext words or 
phrases and which are 'used to form an apparently innocent message, such 
systems are called open code systems . 

b. An example of o concealment system message is the communication 
"HAVE"" ESTABLISHED LOW PRIORITY", in which the secret message "help" has 
been concealed as the first letter of each word of the covering text As 
an example of an open code, in the message ’’AUNT MARY LEFT FOR DETROIT OH 
FRIDAY”, the words AUNT MARY might stand for "five troop ships", DETROIT 
might mean "Southampton", and FRIDAY might stand for "Monday." An often- 
cited case of open code is ttye message "A SON IS BORN", which allegedly 
was sent out by German-controlled radio stations all over the world in 
August, 191^, meaning that war was about to be declared 



£ . The solution of concealment systems may pose considerable diffi- 
culties for the cryptanalyst, who is placed in the rather odd situation 
where he might have before him a simple system, if he can hut find thd 
system. Most of the statistical and other tools at the disposal of the 
cryptanalyst are of no avail to him in the attack on concealment systems. 
First of all, he might not even know whether or not a given letter does 
contain a secret message, often the only reason for an examination of a 
particular message, other than a random sampling case, is that tho origi- 
nator or the addressee is on a suspect list and therefore the communication 
is considered for possible secret writing. The difficulty" in analysis 
is usually not brought about by the complexity of the system, for conceal- 
ment systems are almost always cryptographically simple. The dilficulty 
of the problem arises from the lack, at the outset, of tangible crypto- 
graphic elements into which the cryptanalyst can "get his teeth" There 
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is ptimorily wdc question of whether or not a secret text actually exists, ? 
and, if it does, where are the elements constituting the secret text As 
a consequence, locating the elements of the secret text and deriving the 
meaning of the secret text arc practically synonymous . Success in this 
type of analytic work requires extraordinary patience and perseverance, 
keen powers of observation nurtuied hy unrelenting suspicion, a lively 
imagination, exceptional ingenuity, and organised methods of analysis— 
plus a firm foundation and considerable experience in the methods and ” 
practices of concealment systems 

d The number of different concealment systems possible Is enormous . „ 
The letters of the secret message might he concealed as th^ first, second, 
or third letters of the cover text, or they might be concealed as the 'final 
penultimate, or antepenultimate letters of the words, or they might bo 
concealed by means of a specific key into preari anged variable placements 
within the words of the innocent text. The secret text might he read hy 
consider ing the letters which follow or precede all unnecessary breaks in 
cursive handwriting; or the secret text might he indicated hy shaded 
letters or hy pin pricks over significant letters, or even by elongated 
tails on words pointing to significant letters in the line above. In the 
analysis of such concealed-letter systems, it is advisable to write the 
successive words of the cover text one below the other, in a column, aligned 
hy their beginnings and subsequently to rewrite them columnwise aligned by 
their endings, this will assist in disclosing a secret text hidden in a 1 
fixed position relative to the beginnings or endJngs, or in diagonal routes 
near those locations (see Fig 30l) . It is also advisable to write out the 

“—"7 — — — 41 

In this connection, it is worthwhile to cite an extract from an official 

report prepared in 1946 hy the wartime Office of Censorship; 

"Detection of concealed messages is based on the principle that 
there is no absolutely safe disguise for duplicity. Espionage letters 
have weaknesses and identifying characteristics, which modern techniques 
can minimize hut never completely eliminate. Seasoned examiners develop 
an ability to relate facts and think clearly about possibilities They 
develop a keen perception of, or alertness to, certain peculiarities, 
an attitude of suspicion toward certain indicators, and experience or 
training in handling certain types of materials _ , 

_ j ,v. . - ~ > 6 l t 

"The texts of letters containing concealed messages do not ring 
true; they lack spontaneity, and the normal emphasis which people give 
to certain thoughts or ideas is absent Something comparable in social 
life is the stilted behavior and speech of a person who is obliged to 
entertain a stranger with whom he feels nothing in common, he behaves 
unnaturally, he desires to be polite, but in order to do so he must 3 
hide his boredom and pretend an interest he does not feel. Exactly the 
same is trie in the writing of cover texts or open code letters— the 
attempt to pursue two aims simultaneously results in strain. Skill 
and experience may overcome the strained- text hazard to a high degree, 
but they can never completely dispel the distortion and dislocation of 
a normal emphasis inevitable in a cover letter." 
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cover text in rectangular arrangements of varjous widths, In order to dis- 
close secret text which might have been concealed in every nth letter of 
the entire cover text (see Fig. 102). in cases whore physical indicators 
are employed, such as treats 'in handwriting or as shaded letters, an exami- 
nation of the letters in the immediate vicinity of such Indicators would 
disclose the secret text , 

Cover text: * * Cover text: 

# ^ — 

UNCLE EZRA SEEMS DESPONDENT. * WHEN YOU SEE CHESTER AT 

HAVE YOU HEARD THE LAST REPORT? - MADISON'S HOUSE TELL HIM 

LOIS DEPARTED. 

W H E N Y 0 
USEBCH 
E S T E R A 
T M A D I S 
OKSHOU 
SETELL 
H I M L 0 I 
SDEP AE 
TED" 

REPORT 



UNCLE 

EZRA 

SEEMS 

DESPONDENT 

HAVE 

■"YOU 

HEARD 

"“THE 

LAST 



Secret text: HEED HELP 



Secret text: NEED HELP 



Figure 101. Figure 102. 

e. Some systems involve the concealment of entire words, instead of 
just individual letters, in the cover text Thus, for example, the secret 
text might consist of (l) every nth word of the cover text, (2) the first 
and last words of every line, (37 words preceding or following punctuation 
marks, ( 4 ) words bisected by an imaginary line running diagonally from the 
upper left to the lower right of the sheet of paper; or countless varieties 
of similar schemes. Grilles have also been used, the secret text being 
witten through the apertures of the grille on placed positions on the 
sheet of paper, and then a covering letter written to surround and cam- 
ouflage the secret text. In the solution of concealed-word systems, ex- 
amining the text produced by counting off every nth word may bear fruit; 
if the secret text is long enough, the validity of the assumed secret text 
may be proved by the consistency of the decimation. In cases wherein a 
variable key has been used to indicate which words constitute the secret 
text, proof of the assumed secret text may be impossible, unless the key 
is short compared to the message lengths, or unless additional messages in 
exactly the same key are available for comparison to test an assumed key. 



f . There have been many cases in which a secret text has first been 
converted into the dots, dashes, and spaces of the Morse code, or encrypted 
in a Baconian or a tripartite cipher; then this converted text was concealed 
within an innocent text in any one of the almost infinite number of possible 
ways Some of these ways in which the raultiliteral elements of the pre- 
liminary conversion may be represented are by (l) the lengths of words. 
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(2) the number of vowels or consomnts in the words; (3) the number of 
syllables in the words; or (4) by the ways in which t's are crossed, or i's 
ore dotted . The solution of such systems involves experimentation with"* 
basic hypotheses concerning the monger in which multiliteral elements are 
denoted, followed by a recombination into monoalpbabotic terns (under the 
assumption of a Morse, tripartite, or Boeonian system) and solving the 
1 educed monoalphabctlc text Another method for 0 concealment system 
involves the use of a bipartite matrix employing coordinates consisting 
of vowels (or, for that matter, any othci set of five or six letters), 
the seexen text is first enciphered in this biliteral system, and then 
tne vowels arc surrounded by consonants to form the plain text of an 
innocent cover message. As in most concealment systems, once such a 
suot cifugc is suspected or assumed , then and only Ihen is solution 
pbisiole — — 

g The detailed discussion thus far has been limited to concealment 
systems 8 In cases of open code, unfortunately there are no clear-cut 
juathod3 of analysis or even of recognition, there is simply no rational 
way of proving that a message such as "AUET MARY LEFT FOR DETROIT TODAY” 
contains a secret meaning, unless it is known for a fact that the 3cnder 
naa no aunt named Mary, and even then there still might exist a friend 
of the sender's who is affectionately called "Aunt Mary"— or, for that 
matter, she might be someone else 's aunt 9 And once having suspected 
or even proved that there i3 something rotten in Denmark, proof of the 
content of the hidden meaning is simply out of the question unless the 
serder is somehow convinced to mend his ways and thereupon volunteers 
the information In many wartime instances where open codes have been 
used, a legal case could not be proved against a suspect without his 
cooperation. 

h. A prominent case of the U3e of open code in espionage communi- 
cations is that of an Axis spy, Mrs. Velvalee Dickinson, who in August, 
1|?44, was sentenced in New York to ten years ' imprisonment and was fined 
*10, COO after pleading guilty to the charge that a series of letters she 
hod written to an agent in Buenos Aires in the early part of 1942 con- 
tained secret messages hidden in the plain text. These messages gave 
information regarding the location and condition of allied warships in 
Pacific ports These two agents professed to be dealers in antique aolls 
and used a prearranged code giving secondary meaning to words pertaining 
to the sale of dolls Mrs . Dickinson would send out letters advertising 
01 offering to sell certain of her antique dolls to the addressee. She 
would write the doll's name and after the, name a brief description, then 
she would write, as in an ordinary business letter, the price of each 

. — — 

Further discussion of this subject will be found in Appendix 9, 
"Concealment Systems," 

9 In one instance, it has been related that a censor reviewed a tele- 
gram transmitted by a person on a suspect list The telegram read "FATHER 
IS DECEASED." The censor, smelling a rat, changed the text to read "FATHER 
IS DEAD", and waited. Sure enough, several hours later came a query: “IS 

FATHER DEAD OR DECEASED?" ’ 
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doll The originoD cause for* suspicion was the eytro®** variation In 
prices ovei a range of three or four letters of what was apparently the 
same doll ox the same type of doll A great ranny Ictteis irex" necessary 
in order to "build up a cape sufficient to prove the uae of open code. 

It is doubtful even then that the use of open code could have been legally 
proven except for’ the fact thht ; faced with so much evidence against her, 
she chose to confess this use. 1 

M * * l ’'*.<■4 r 

t 

J.. In addition to concealment systems and open codes, there ore 
three other methods for hiding the existence of secret text. These 
methods embrace the following^ 

* t » 

(1) secret inks; 

(2) microscopic writing, involving use of micropantographs j and 

(3) photographic methods, including "mierodots" (i.e., the 
reduction of a page of copy to a negative the size of a miniature 
dot, which is then affixed on a period or on the dot of an "i") , 
double printing, double exposure, or' concealment within photographs. 

i- , 

The methods of use and analysis of these systems, however, are beyond the 
scope of this text. ’ ~ ~ ' 

4 j. 

8U Concluding remarks — a The student will have by this time 

appreciated that monoalphabe t ic substitution ciphers are for the mogt 
part quite easy to solve, once the underlying principles are thoroughly 
understood. As in other arts, continued practice with many examples 
leads to facility and skill in solution, especially where the student 
concentrates his attention upon traffic all of the same general nature, 
so that the type of text which he is continually encountering becomes 
familiar to him and its peculiarities or characteristics of construc- 
tion give clues for short cuts to solution It is true that a know- 
ledge of the general phraseology of messages, the kind of words used, 
their sequences, and so on, is of very great assistance in practical 
work in all fields of cryptanalysis In operational cryptanalysis, it 
is of vital importance to gain a knowledge of the language habits of a 
particular group of correspondents, to permit the rapid exploitation of 
the cryptosystem involved. Thus, at" least initially, all possible traffic 
is cryptanalyzed, even tnat in simple systems and that of comparatively 
little intelligence value Word lists obtained empirically are of more 
value than "intuitive" or academic compilations, however, at the outBet, 
reference may of course be made to these latter compilations 1° 

b. Some of the simpler subterfuges which the student should be on 
the lookout for in monoalphabetic substitution are the following: 

I 

(l) There may be employed in the cryptographic scheme the con- 
secutive use of several different mixed cipher alphabets in a single 
long message Obviously, a single, composite frequency distribution for 
the whole message will not show the characteristic crest and trough appear- 
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See in this connection the word and idiomorph lists comprising Appendix 3< 



C ONFIDENTIAL 



259 




REF ID:A56892 



CO NFIDE I mA E" 

ance of a simple nvonoalpbabetlc cipher, since a given cipher unit will 
represent different plaintext letters in different parts of the message. 

But if the cryptanalyst will carefully observe the distribution as it is 
being compiled , he will note that at first it presents the characteristic 
crest and trough appearance of monoalphabetieity, and that after a time 
it begins to lose this appearance If possible he should be on the look- 
out for some peculiarity of grouping of letters which serves as an indi- 
cator for the shift from one eipher alphabet to the next. If he finds such 
an indicator he should begin a second distribution from that point on, and 
proceed until another shift is encountered. _By thus isolating the dif- 
ferent portions of the text, and restricting the frequency distributions 
to the separate monoalphabets , the problem may be treated then as an ordl- 
, nary simple monoalphabetic substitution. 11 Consideration of these remarks 
in connection with instances of this kind leads to the comment that it is 
often more advisable for the cryptanalyst to compile his own data, than 
to have the latter prepared by clerks, especially when studying a system 
ab Initio . For observations which will certainly escape an untrained 
clerk can be most useful and may indeed facilitate solution. For example, 
in the case under consideration, if a clerk should merely hand the com- 
pleted over-all uniliteral distribution to the cryptanalyst, the latter 
may be led astray, the appearance of the composite distribution might con- 
vince him that the cryptogram is much more complicated than it really is. 
While still on the subject of frequency distributions, it is pointed out 
that, although earlier (par 43) the triliteral frequency distribution 
was cited primarily for its usefulness in extracting frequency data relative 
to the digraphs and trigraphs occurring in a simple substitution cipher, 
this particular type of distribution is used extensively in the manual 
attack on many other types of cryptograms because it provides one of the 
best means for systematically locating all of the repetitions which appear 
in a message . 

(2) There have been cases where direct and reversed standard al- 
phabets have been used alternately in a single cryptogram, the change of 
alphabets being made at Irregular Intervals, or changed at the end of every 
word or with each group of five letters . If the interruption takes place 
at too short an Interval, not only will a frequency distribution be of no 
avail, but also it would be almost impossible for the cryptanalyst to de~ 
tei mine when and how the change of alphabets occurs from a mere examination 
of the cipher text However, if the cryptanalyst is on the alert to try tbe 
simplest tiling first , completing the plain-component sequence on the as- 
sumption of standard alphabets will yield a solution vhore otherwise a 
solution might be out of the question 

" £ 

(3) Another subterfuge that has been encountered is the encryption 
by means of a ironoalphnbetic uniliterai substitution of a message whose 



11 The cryptanalyst should be on the alert foi the possibility of re] nted 
alphabets in such a system, if thjs is ths cu3e, the reconstruction oi th rt 
piii">ry co “portents fiou uhe solution of one portion of the i -ssrga * ould 
enable the reading of oth^r ps: lions of tl^r ireosrge by r rn., c*£ ■’h gei *rn- 
trix i> rtl ol ti«. ted in p-i 50 
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plain t<?<t ha<3 first been -written bn /Re wards (or for tlint matter, on ordinary 
simple substitution cipher sent backwards ) Ciphers of this type may suc- 

cessfully resist the unsystematic attempts of solution which a tyro might 
wake, however, the experienced analyst would probably quickly recognize the 
went subterfuge if he wore to examine the frequencies of cipher digraphs, 
trigrapbs, and tetragraphs, in relation to the unilateral frequencies of 
their component letters , „ , 

c. KonoeOphabotlc substitution with variants represents on extension 
of the basic principle, with the intention of masking the characteristic 
frequencies resulting from + a strict monoalphabet icily, by means of which 
solutions a->*e rather reodily obtained Some of the subterfuges applied on 
tbo establishment of voriant^or multiple values are simple and more or less 
fax! to serve the purpose for which they are intended; others, on the con- 
liaiy, may interpose serious difficulties to a straightforward solution. 

But in no cose way the problem be considered of more than ordinary dlffi- 
- - ’ V-ivsjc it should be recognised that where these subterfuges 

-re v tally adequate to the purpose, the co’irplications introduced ore orv'b 
tnat the practical manipulation of the system becomes cs difficult fox- the 
cryptographer as for tae cryptanalyst 

(1) A few words nay be added here in regard to a method which 
c-fton suggests itself _to laymen, but which is very old indeed in the art. 
fhio eerstets in using a book possessed by all the correspondents and 
Otdicutlng the letters of the message by means of numbers referring to 
•'-CC3 + 1c letters in the book One way consists in selecting a certain 

page and then giving the line rvunboi and position of the letter in tno 
3 in-', the page number being shown by a single initial indicator Another 
wry is to vso the entire boolw. Giving the cipher ’’equivalents in groups 
of three numbers representing page, line, ond number of letter (for example, 
75-0-10 means page 75r 8th line, 10th letter in the line) Such systems 
are, however, extremely cumberooPi^ to use and, when the cnciphci'ing Is 
done carelessly, c*an be solved The basis for solution in such coses roots 
upon the use of adjacent letters on the same line, the accidental repe- 
titions of certain letters, end the occurrence oi unenciphcrcd worda in 
the messages , when laziness or fatigue intervenes in the enciphering 12 

(2) It may also ce indicated that human nature and the falli- 
bility of cipher clerics is such that it Is rather rare for an encipherer 

k. •» 

In 1915 chc German Govefnn\'»nt conspired with a group of Hindu revo- 
lutiOEnnes to sxir up a rebellion in India, the purpose being to cause 
th'* withdrawal of British troops from the Western Front Hindu conspira- 
tors In tne United S to tea 'were given money to purchase arms' and eanc&oai- 
tjon and to transport them to India* For comarunicat J on with their sup- 
eiioro in Berlin the conspirators used, among others, the aybten described 
in tnrs paragraph A 7~V°Z e typewritten letter buI3 t up Iron rage, line, 
and letter -number references to a book knoim only 10 the coiurauicants , 
vis intercepted by the Erixish and 'turned over 10 tno United Suite a Gov- 
07 muni foi use in connection with the prosecution of the Hindac for 
violating our peuti ald-jpy ’ , IThe author /^U.r.F JJ solved this massage with- 
out the boon in question, by taking full ndvante&e of the clues referred 
to " 
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to make full use of the complement of variants , placed at his disposal. 

The result is that In most cases certain of tjie equivalents will he used 
so much more often than others that diversities in frequencies will soon 
manifest themselves, affording important data for attack by the crypt- 
analyst t 

d There is one aspect of cryptography within the realm of mono- 
alphabetic substitution ciphers that should he discussed at this point — 
the aspect involving repetitive monoalphabet ic substitution. 

(1) Suppose a message undergoes a primary encipherment by means 
of a single mixed, non-reciprocal cipher alphabet, and this piimary ci- 
pher text then undergoes a secondary encipherment by means of the same or 
a different mixed alphabet The resulting cryptogram is still monoalpha- 
hetlc in character, and presents very little, if any, augmentation in the 
degree of security (depending upon the type of alphabet employed) .T3 Here 
an entirely illusory increase in security is involved and on ineffectual 
complexity is Introduced, the process may Indeed be repeated indefinitely 
without producing the desirable result of added security Similarly, the 
some illusory increase in security is present in the oase of repetitive 
multiliteral encipherments involving regular-length ciphertext units, as 
long as the repetitive encipherments aie made "on the cut" . 

(2) In the case of repetitive polygraphic encipherment made on 
the cut, a moderate increase in security is achieved over the degree of 
security normally provided hy a single polygraphic encipherment. For 
instance, in the case of repetitive digraphic encipherment using, let us 
say, a four-square system for the first encipherment and modified Play- 
fair system for the second step, the final encipherment is still monoalpha- 
betic digraphic in character, except that the* cryptosystem might have to 
he resolved as involving a more-or-less random square table, instead of 
being recovered in its primary and secondary steps, all the repetitive 
encipherment has accomplished is that it has added to the difficulty of 
reconstruction of the matrices used — hut this, in the case of a digraphic 
system, is a reasonably fair increase in security, since we expect solution 
to he expedited through an early recovery of the matrix 

(3) When, however, successive multiliteral or polygraphic en- 
elphernents are made "off the cut" for the second step, the increase in 
security can he considerable, since the end result no longer exhibits the 
phenomena of monoalphabeticity and the cryptanalytic complexity of the 



13 ’■ - ^ - 

The only possible slight increase in security lies in the fact that 

the key words for the primary and secondary encipherments might be made 

more difficult to recover or even impossible to recover 
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system has been thereby materially enhanced For example, using the 
two -square matrix illustrated in Fig 55 on page 162 , the massage 
KEEI'E* QRCEi EIITS HEEDFD undergoes the following encipherments* 

HE FN FO RC EM EH TS HE ED ED 

Cipher I IL DP UM CF KT DP GI UL DF DF 

V'VVVWWN/ 

Cipher II OC OT MC^ MR TD QF TO OC AH 

The first encipherment, IL DP UM. , is subjected to a second encipher- 
ment by considering tho digisphs "off the cub", resulting in the encrypt- 
ment OC 05? M3 In the final cryptogram, the first and last letters of 

the primary encipherment may be retained os is, or they may be combined 
for the second encryption, for added security, thus the final cryptogram 
may read either IOCOT OCABF, or ROCOT . OCAHG When this sort of 
secondary encipherment is applied in a repetitive multillteral cipher, the 
system is called a fractionating system 15 The cryptanalysis of these 
systems, which is often quite complex, will be treated in subsequent texts. 

e. If the cryptanalyst is fortunate enough to have a pair of icologs, 
one message of vnieh is in a nionoe lphabetic substitution system and the 
other in a transposition system, it may be possible for him to make exact 
identifications of the elements in the substitution cipher based on the 
plaintext letter frequencies present in the transposition cipher. Then, 
having the plain text, tne solution of the transposition is greatly facil- 
itated 



14 A rather ingenious idea proposed by Charles Eyraud in his excellent 
work, Precis de Cryptogrophie Hoderne , Paris, 1953, PP 224-225, involves 
a repetitive cncipneiment using two different monome-diuoire matrices. In 
Eyraud’s example, using tho two matrices illustrated, the plain text 



1 

2 



I23456789 

HFTaFtTr 

UDLFVQMPC 
HGO B XUZK 



Matrix I 




w 

Matrix II 



E CRIT UR ES SE C R ETES 

3 19 9 8 7 11 9 3 4 4 3 19 9 3 7 3 4" 

31 9 9 8 7 11 9 34 4 31 9 9 37 34 

IAAZUCAQBIAAFQ 

"ECRITURES SECRETES" is first enciphered with Matrix I, uhen the digits are 
recombined into letters using Matrix II vith the resulting cj.pn.er text 
IAAZU (It Is interesting to note that the 17 letters of the plain text 
are encrypted by only l4 letteis in the final cipher*) The letter Yp is 
eliminated from Matrix I, and is included m .Matrix II xo take care of a 
final 1 or 3 in tne fix st encipherment which otherwise could not have been 
encrypted as a single element. 

■*•5 See Appendix 7 for other examples 
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f As has already been stated in subpar 2c, mathematics and math- 
ematical methods have an important place in the art of cryptanalysis . This 
text has included only those introductory statistical and mathematical 
applications -which apply to monoalphabet ic systems If it appears to the 
student that there has been a rather extensive treatment of too-specialized 
techniques, let him be reassured that these have been included as being in 
the nature of collateral information, rather than being an absolute* necessi- 
ty in the solution of the particular problems to which they were applied 
As a final wold of eaution to the student the following extract from a re- 
poi t hy C . H. O'D. Alexander is included: 

"There is a considerable danger that a learner, when he realizes that 
statistical methods can he of some use, will attempt to use them where they 
are quite Inappropriate If he does this a few times and finds it gets him 
nowhere, he then gives the whole thing up as a waste of time and does not use 
such methods where he might There is also the worse danger of doing statis- 
tical test3 for their own sake so that they are used a3 a method of passing 
the time and avoiding real thought about the problem to he solved " 

g. The general problem of cryptanalytic diagnosis has been discussed 
briefly in various Sections of this text The problem is far from simple, 
since many variations and conventions may he encountered in the various 
systems treated in this text, furthermore, the problem is made even harder 
by the fact that certain systems, themselves quite simple, may he combined 




REF ID:A56892 



CONFIDENTIAL 

_ - i 

to produce a system much more difficult to diagnose The lack of precise 
diagnostic tests, such as those available in the natural sciences,!*^ is 
brought about by the fact that variations and conventions introduced into 
otherwise conventional systems may change radically the appearance and 
manifestations expected in the cipher text produced by the known systems, 
yielding "hitherto -unencountered phenomena." Each cryptosystem is then 
actually an individual and unique case in diagnosis If 

(l) For example, four-letter cipher groups of the pattern con- 
sonant-vowel-consonant-consonant jdo not necessarily prove a code system, 
even though this grouping is a frequent one in four-letter code systems, 
the basic system might still be a cipher system, with the apparent 

The author feels that it is of value to pursue further a discussion 
of how the science of cryptanalytics compares with some branch of one of 
the natural sciences, when the diagnostic procedures involved in each are 
considered. In that branch of biology called taxonomic botany, for example, 
the first steps in the class if icatory process are based upon observation of 
externally quite marked diffeiences; as the process continues, the observa- 
tional details become finer and finer, involving more and more difficulties 
as the work progresses Towards the end of the work the botanical tax- 
onimist may have to dissect the specimen and study internal characteristics . 
The whole process is largely a matter of painstaking, accurate observation 
of data and drawing proper conclusions therefrom Except for the fact that 
the botanical taxonomist depends almost entirely upon ocular observation of 
characteristics while the cryptanalyst in addition to observation must use 
some statistics, the steps taken by the former are quite similar to those 
taken by the latter It is only at the very end of the work that a sig- 
nificant dissimilarity between the two sciences arises. If the botanist 
makes a mistake in observation or deduction, he merely fails to identify 
the specimen correctly, he has an "answer" — but the answer is wrong He 
may not be cognizant of the error, however, other more skillful botanists 
will find him out But if the cryptanalyst makes a mistake in observation 
or deduction, he fails to get any "answer" at all, he needs nobody to tell 
him he has failed. Further, there is one additional important point of 
difference The botanist is studying a bit of Nature — and she does not 
consciously Interpose obstacles, pitfalls, and dissimulations in the path 
of those trying to solve her mysteries. The cryptanalyst, on the other 
hand, is studying a piece of writing prepared with the express purpose of 
preventing its being read by “any "persons for whom it is not intended. The 
obstacles, pitfalls, and dissimulations are here consciously interposed by 
the one who encrypted the message. These, of course, are what make crypt- 
analytics different and difficult. 

^ Baudouin ( op c it . , ''Chapter XIV ) drew up a sort of check list of the 
class if icatory procedures r which an ajialyst might follow when attempting to 
diagnose the cryptosystem' underlying" a“ particular cryptogram or cryptograms. 
However, the science of cryptanalytics being as it is does not lend itself 
to successful completion of such diagnostic "check lists." Thus, the one 
compiled by Baudouin is far from satisfactory and is of no more than aca- 
demic interest to the present-day practicing cryptanalyst 
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characteristics of a code system. Upon closer examination, it might he 
possible to disprove a code system, based on the non-appearance of certain 
other characteristics that should be present in a code system 

(2) If a cryptogram or a set of cryptograms contain only the 
letters A through 0 in the cipher text, all that can be said Initially 
is that only 15 letters are present in the encrypted text, and that 
the system must be one of substitution, either cipher or code. If a 
cipher, then the system must of course be a multiliteral system (including 
perhaps a mixed- length system), not excluding, for example, a digraphic 
system or a cod- chart For instance, in the hiliteral matrix below, 
the cipheitext units consist only of pairs of consonants, and the plain- 
text elements include the 2.6 letteis and the 37^ most frequent digraphs; 
thus the system is essentially a digraphic system Such a system would 
not be at once recognized as a digraphic system, and if the vowels were 
used as nulls, the diagnosis of the cryptosystem would be considerably 
impeded 

h. The often extensive and elaborate treatment of the many varieties 
of cryptosystems within the scope of this text has not been given solely 
for the sake of the analysis of the particular systems involved, but rather 
to illustrate the general cryptanalytic techniques which are applied to 
various problems . In being guided along the lines of "thinking cryptanaly- 
tically", the student has been put in a position to analyze successfully 
many possible variations and modifications of the cryptosystems treated in 
this text and in the accompanying course . The cryptosystems in this text and 
accompanying course have been solved for the most part from one or two 
messages Naturally, there is a certain amount of artificiality in the 

BC DFGHJKLMNPQRSTVWXZ 
B I A AA AB AC AD AE AF AG AH AI AK AL AM AN AO AP AR AS AT AU 

C AV AW AY & BA BE BI BL BO BR BT BU BY C CA CC CE CH Cl CK 

D CL CO CR CT CU CY P DA DB DC DD DE DF DC DH DI DL DM DN DO 

F DP DQ DR DS DT DU DV DW DY E EA EB EC ED EE EF EG EH El EJ 

G EL EM EN EO EP EQ ER ES ET EU EV EW EX EY EZ F FA FC FE FF 

H FI FL FO FR FS FT FU FY Gr GA GC GE GF GG GH GI GL GN GO GP 

J GR GS GT GU GW H HA KB HC HD HE HF HI HL BM HN HO HR HS HT 

K HU HY l IA IB IC ID IE IF IG DC IL IM IN 10 IP IR IS XT IV 

L IX IZ JJAJEJOJU KKAKEKIKS U LALBLCLD1BLFLG 
M LI LL LM LN LO LP LR LS LT LU LV LW LI M MA MB IC ME MI MM 

N MO MP MR MS MT MU MY N! NA NB NC HD HE NF NG NH NI NEC IJL NM 

P HN NO NP NR NS NT NU NV NW NY O OA OB OC OD OE OF OG OH 01 

Q OK OL OM ON 00 OP OR OS OT OU OY OW OX OY P PA PE PF PH PI 

R PL FM PN PO PP PR PS PT PU PY Q QU R RA RB EC RD RE RF RG 

S RH RI RL RM BN RO RP RR RS RT EU KV KW RY S SA SB SC SD SE ' 

T SF SG SH SI SK SL SM SIT SO SP r SR SS ST SU SW SY T TA? TB TC 

V TD TE TF TG TH TI TL TM TN TO TP TR TS TT TU J? W TY TZ U UA 

W UB UC UD UE UG UI UL UM UN ’UP UR US UT V VA VE VI 'VO W WA 

X WE WH WI WL WN WO WR WY X XA XC XEJCF XI £N XP XT Y YA YB 
Z YC YD YE YF YG YH YI YL YM YN YO YP YR YS YT YW H. ZA ZD ZI 
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example a and messages c nployed herein The text? of messages have been 
manipulated , especially in connection with the accompanying problems, 
ill order to illustrate^ pedagogical principles and the application of 
cryplanatytic techniques In actual practice, instead of the one or two 
messages , five might be required, or for that matter, fifty or more might 
bo xequired in ’ox der^ to effect a solution In operational practice, there 
is frequently a" high incidence of garbles which would have a pronounced 
impact on hot only "a "facile Identification of the crypi osystem but also 
on its subsequent solution Speed is an essential criterion in operational 
practice, a cryptosystem must be broken and messages reod as soon as 
possible, to bo of maximum use to a field commander— messages read six 
or twelve months after they were sent are hardly of more than historical 
iBiportance Nevertheless, when a system is cryptanalyzed for the first 
time, no matter when it is broken it helps maintain cryptologic continuity 
which is of extreme importance in successful operational practice. 

i The student should now study, if he has not already done so, the 
various appendices to this text Through them, he may gain an insight into 
further aspects of cryptography and topics related to the art of cryptana- 
lysis Practice on many different ciphers of the types covered in this 
text will tend to sharpen the wits and give to the student confidence and 
facility in the cryptanalysis of unknown examples It 3s for this reason 
that a course of problems is a necessary adjunct to the study of this 
text ; as was previously mentioned, one month's actual practice in solution 
is worth a whole year's mere reading of theoretical principles 

j . It may be of assistance to indicate, by means of a graphic out- 
line, the relationship existing among the various cryptographic systems 
thus far considered The outline will be augmented with each succeeding 
text as the different cryptosystems are encountered, and will constitute 
what has already been alluded to in par 6d and there termed a "synoptic 
chart of cryptography" The synoptic chart for this text (Chart 9) forms 
an insert following this Section. Looking at this chart the student may see 
that, although it is essentially dichotomous in form, at several levels 
there appears a sort of cryptographic tertium quid — some category (or cate- 
gories) of cryptosystems which properly belongs at the particular level 
shown, but which does not directly fit into either of the two primary sub- 
divisions already appearing at that level However, if the student will 
study the synoptic chart attentively, it will assist him in fixing in 
mind the manner xn which the various systems covered thus far are related 
to one another, and this will be of benefit in clearing away some of the 
mental fog or haziness from which he is at first apt to suffer. 



k. There remain five more volumes to this series of basic texts on 
the art of cryptanalysis Military Cryptanalytics, Part II, will treat 
mainly periodic polyalphabetic substitution ciphers, including periodic 
numerical systems. Part III will treat varieties of aperiodic substitution 
systems, including an introduction to elementary cipher devices and crypto- 
mechanisms, Part IV will treat transposition and fractionating systems, and 
combined substitution-transposition systems. Part V will treat the recon- 
struction of codes, and the solution of enciphered code systems, and Part VI 
will treat the solution of representative machine cipher systems . In addition, 
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throughout the five remaining texts there will he interpolated statistical 
techniques applicable to the systems treated, and information on the appli- 
cation of analytical machines in cryptanalytic problems. The security 
classification of each succeeding text will vary according to the information 
contained therein. It is not intended that the student study all six texts; 
life is too short to become an expert cryptanalyst in all fields of the art. 
Parts I and II embrace most of the necessary fundamentals of cryptanalysis; 
the succeeding four volumes will impart knowledge on more specific categories 
of systems with which the cryptanalyst may be faced. r 
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Chart 7. Synoptic chart of cryptography for Military Cryptanalytics, Part I 
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1»_ Thi b glossary Is designed primarily to be used in connection 
_ ,. .yith the text^ Mirftary Cryptanalysis Part I , It is limited in scoper 
ix> cryptologic terms actually appearing In the text, terms likely to 
1 "be encountered in other ^ cryptologic literature of approximately the 
"same level as the text,' and a few other terms' considered necessary to 
complement or \o clarify certain definitions . 



i -j. <,* ,2, The terms in F thls glossary are arranged in strictly alpha- 
* betical order, disregarding word' spaces and hyphens • Single words 
and certalp hyphenated words are followed directly by an abbreviation 
of the part of speech! Rim-on entries, Indicating a part of speech 
different from that of ihe main entry', are' shown simply by means of 



a series of dashes followed by the abbreviation of the part of speech, 

'definition! ^Abbreviations used for parts of 



and the appropriate 
speech, as well as those used to indicate examples, cross-references, 
etc., are those listed in Webster's Hew International Dictionary , 
Second Edition, ' 
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GLOSSARY FOR MILITARY CRYPTANALYSIS, PARE I 



accidental repetition . A repetition produced fortuitously, and not by 
-the encipherment of Identical plaintext letters by Identical keying 
elements* (Cf* causal repetition *) 

additive t n* A single digit, a series of digits, or a numerical group 
wEIch, for the purpose of encipherment, is added to a numerical 
code group or to numerical cipher or plain text* 

additive book * A book comprising a group of additive tables* 

additive system * A cryptosystem in which encipherment is accomplished 
through the application of additives* 

additive table . A tabular arrangement of additives* 

addressee* n* The office, headquarters, activity or Individual to vfaom 
a message la directed hy the originator* 



ADFOVX system * A German high-command cipher system used in World War I* 

Assent telly, a biliteral substitution system employing a 6 x 6 square, 
to which a columnar transposition vaa subsequently applied* 

appllqu/ unit, teleprinter * A special cipher attachment used in cqnnec- 
tion with" a teleprinter to provide cryptographic treatment for tele- 
printer messages. 

artificial ward . A group of letters having no real meaning, constructed 
Vy the' systematic arrangement of vowels and consonants 89 as to give * 1 
the appearance and pronounceahlllty of a hona fide word* 



Baconian cipher . A cipher system Invented hy Sir Francis Bacon (1561-1626) * 
it is basically a monoalphabetlc substitution system in which single 
plaintext letters are represented hy five -letter cipher equivalents 
formed by permutations of two letters taken five at a time* 

baud* n* A mark or space lx&pulse In the International (Baudot) teleprinter 
code* 



Baudot code . A five-unit code applied to teleprinter systems by Jean Maurice 
"Emile Baudot (1845-1903). It employe a 32-element alphabet composed 
of permutations of two elements taken five at a time* Also called the 
international teleprinter code . 

biliteral * adj. Of or pertaining only to cryptosystems, cipher alphabets, 
and frequency distributions which Involve cipher units of two letters 
or characters. See the more Inclusive term dlgrephlo j see also bl- 
literal frequency distribution * 
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billteral alphabet « A cipher alphabet involving a cipher component com- 
posecF of 'tvo-ebaracter units. 



- 'r j ‘V 

biliteral frequency distribution .' ~ A frequency distribution of pairs 
formed by combining successive letters or characters . Thus, a 
* biliteral distribution of ABCDEF would list the following pairs: 
AB, BC, CD, DB, E3T. (Cf. dlgraphic frequency distribution .) 



» , lb v ( * ' v 

bipartite alphabet . A billteral alphabet in which the cipher units may 
be divided into two separate parts whose functions are clearly 
- defined, viz., row indicators and column indicators of a matrix. 



bipartite system . A substitution system involving the use of a bipar- 
tite alphabet. 

blank-expectation test . See lambda test* . 1 ' 

bust message . A message containing an error in encipherment which 
Jeopardizes the cryptographic security of the message, and thus 
is potentially valuable to the cryptanalyst. 

,r *•«.- r- 

Caesar’s cipher . An, ancient form of simple substitution cipher in which 
each" plaintext letter was replaced by the letter three places to 
the right of it in the normal alphabet; attributed to Julius Caesar. 

v — 

call sign . A group of letters or numbers, or a combination of both, 
used as, the identification for a telecommunication station (or 
stations), when stations are establishing contact with each other. 

c «** * i ^ 

causal repetition . A repetition produced by ilie encipherment ""of identi- 
cal plaintext letters by identical keying elements. 

cell , n. An Individual small square oh cross-Sectlon paper, grilles, etc. 

characteristic frequency . Bee normal frequency . v 

chi-square" ( ~)t) table . A mathematical table listing tJie probabilities 
of occurrence by chance of a chi-square value higher than that 
observed in a given case; an adjunct to the chi-square test . 

■the rfela- ' 
same source. 

For example, the test can be used to aid in the determination of 
whether a distribution is more likely to be random or not; in this 
usage, the observed distribution is compared with a theoretical 
distribution representing that which is expected for random. The 
end result of the test is a value representing the discrepancy 
^.-between the two distributions which have been compared. This 
value, called a "chi-square value" may be interpreted as it is, 

-/or it may be Interpreted through the use of a chi-square table . 



chi-square (% ) test . A mathematical means for determining 
tive likelihood that two distributions derive from the 
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chi (*y) tost . A test applied to the distributions of the elements of 
~ two cipher texts elthar to determine whether the distributions ere 
the result of encipherment by identical cipher alphabets, or to 
determine vhether the underlying cipher alphabets ore related. 

Also called the cross-product test . 

clfax , n. Enciphered facsimile. The process of converting a piano image 
into on unintelligible imago or series of electrical impulses ond of 
reconverting it or them into intelligibility through the use of a 
key.— adj. Using or pertaining to cifax. 

cipher , n. 1, A cipher system. 2, A cryptogram produced by means of a 
cipher system. — adj. Pertaining to that vhieh enciphers or is en- 
ciphered. 

cipher alphabet . An ordered arrangement of the letters (or other con- 
ventional signs, or both) of a written language and of the characters 
which replace them In a cryptographic process of substitution. 

cipher clerk . A clerk who enciphers and deciphers messages. 

cipher component . The sequence of a cipher alphabet containing the symbols 
wEichropIace the plain symbols in the process of substitution. 

cipher device . A nonmechanical and nonelectrical apparatus used for 
enciphering and deciphering. 

cipher disk . A cipher device consisting of two or more concentric disks, 

7 each bearing on its periphery one component of a cipher alphabet. 

cipher machine . A mechanical or electrical apparatus for enciphering and 
deciphering. 

cipher square . An orderly arrangement or collection of sequences set 
forth "in a rectangular form, commonly a square (c.g., a Vigenkre 
square) 

cipher system . Any cryptosystem in which cryptographic treatment is 

applied to textual units of regular length , usually monographic or 
digraphic . (Cf. code system .) 

cipher text . The text of a cryptogram which has been produced by means of 

* a cipher system. 

clphony , n. Enciphered telephony. The process of converting vocal commun- 
ications into unintelligibility and of reconverting them into intel- 
ligibility through cryptographic treatment.— adj. Using or per- 
taining to ciphony. 

citrol, n. The process of converting control and telemetering signals, 
such as those used in missile guidance, into unintelligibility and 
reconverting them into intelligibility through cryptographic treat- 
ment.— adj. Using or pertaining to citrol. 
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clvlslon , n. Enciphered television. A system of converting television 
signals into unintelligible signals and vice versa, in accordance 
with certain predetermined procedures .-~-adj . Using or pertaining 
to civision, 

clear text . Plain text. ' * 

code , n. 1. A code Bystem. 2. A code hook.— adj. Pertaining to 

that which encodes or is encoded. 

* . ' - . - 

code book . A book or document used in a code system, arranged in syste- 
matic form, containing units of plain text of varying length 
(letters, syllables, words, phrases, or sentences) each accompanied 
by one or more arbitrary groups of symbols used as equivalents in 
messages * 

i 

code chart . A chart In the form of a matrix containing letters, syllables, 
numbers, words and, occasionally, phraseB. The matrix has row end 
column coordinates for the purpose of designating the plaintext 
elements within. 

code clerk . A clerk who encodes and decodes messages . “ 

code group . A group of letters or numbers, or a combination of both, 
assigned (in a code system) to represent a plaintext element. 

code message . A cryptogram produced by encodement. 

code system . A cryptosystem in which arbitrary groups of symbols represent 
plaintext units of irregular length, usually syllables, whole words, 

’ phrases and sentences. 

code text . The text of a cryptogram which has been produced by means of 
a code system. 

coincidence test . The kappa test. A statistical test applied to two 

ciphertext messages to determine whether they both involve encipher- 
ment by the same sequence of cipher alphabets. 

columnar transposition . A method of transposition in which the cipher- 
text equivalent of a message is obtained by transcribing the columns 
of a matrix into which the message was inscribed earlier according 
to some scheme other than this vertical one. 

column coordinate . A symbol normally at the top of a matrix or crypto- 
graphic table, identifying a specific column of cells, used in 
conjunction with a row coordinate to specify an individual cell in 
the matrix or table. Also called c olumn indicator . 

column indicator. See column coordinate. 
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communication intelligence (COMINT) . Evaluated and Interpreted Informa- 
tion derived from the study of intercepted communications* 

communication security (COMSEC) * The protection resulting from all 
measures designed to deny to unauthorized persons Information of 
value which might he derived from communications* Cryptoaecurlty , 
transmission security , and physical security are the components of 
communication security. 

commutative , odj. As applied to cipher matrices, so constructed as to 
permit coordinates to bo read in either row-column or column-row 
order without cryptographic ambiguity. 

component , n* One of the two sequences (plain and cipher) which compose 
a cipher alphabet. 

compromise , n. The loss of security of a classified document, information, 
or material, which results from the possibility of an unauthorized 
person or persons having knowledge thereof. 

concealment system . A method of secret communication so designed as to 
convey a secret message without its presence being suspected by 
others than the addressee. In its most usual form, the plaintext 
elements are concealed by combining them with extraneous plaintext 
elements in such a way that the end result is an intelligible and 
apparently innocent message. (Cf. open code .) 

crest , n. In its cryptologic application, a point of high relative fre- 
quency in a frequency distribution. 

* 

crib , n. 1. Plain text assumed or known to be present in a cryptogram. 

2. Keys assumed or known to have been used in a cryptogram*— v.t. 

To fit assumed or known plain text or keys into the proper position 
in an encrypted message. 

cross-product test . See chi test . 

cryptanalysis , n. The steps and operations performed in applying the 
principles of cryptanalytics . 

cryptanalyst, n. A person versed in the art of cryptanalysis. 

cryptanalytlc , adj. Of, pertaining to, or used in cryptanalytics. 

cryptanalytics , n. That branch of cryptology which deals with the prin- 
ciples, methods, and means employed in the solution or analysis of 
cryptosystems • 

cryptanalyze , v.t. To solve by cryptanalysis. 
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cryptogram , ' n. A communication in visible -writing -which conveys no 

intelligible meaning in any known language, or which conveys some 
meaning other than the real meaning. 

cryptographer , n. One who encrypts or decrypts messages or has a part 
in making a cryptographic system. 

cryptographic , adj. Of, pertaining to, or concerned with cryptography. 

cryptographic ambiguity . Uncertainty as to the method of decryption or 
as to the meaning intended after decryption; created by a fault In 
the structure of a cryptosystem. 

cryptographic arithmetic . The method of modular arithmetic used in 
cryptographic procedures which involves no carrying in addition 
and no borrowing in subtraction. 

- « •* . *■ „ 

cryptographic security . See cryptosecurity . 

- , t 1 » 

cryptographic system . See cryptosystem . 

- * ; i j ^ t ‘ - 

cryptographic text . Encrypted text; the text of a cryptogram. 

* - ' lr 

cryptography , n. That branch of cryptology which treats of the means, 
methods, and apparatus fpr converting or transforming plaintext 
messages into cryptograms, and for reconverting the cryptograms 
into their original plaintext form by a simple reversal of the 
steps used in their transformation, 

cryptologic, adj. Of, pertaining to, or concerned with cryptology, 

cryptology, n. That branch of knowledge which treats of hidden, disguised, 
or encrypted communications. It embraces all the means and methods 
of producing communication Intelligence and maintaining communication 
security; for example, cryptology includes cryptography, crypt- 
analytics, traffic analysis, eto. 

cryptomaterial , n. All documents, devices and machines employed in 
encrypting and decrypting messages, 

cryptomathematlcian , n. One versed in cryptomathematics. 

oryptomathematics i n. Those portions of mathematics and those mathematical 
ino'ihods' which have cryptologic applications. 

cryptosecurity , n. That component of communication security which 

results from the provision of technically sound cryptographic systems 
and from their proper use. 

cryptosystem , n. The associated items ox oryptomaterlal and the methods 
and rules by which these items are used as a unit to provide a 
single means of encryption and decryption. A cryptosystem enbraces 
the genera i cryptosystem and the specific keys essential to the 
errployment of tne "general cryptosystem* 
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cyclic, cdj. Periodic; continuing or repeating so that the first term 
of a sorloo follows the last; characterized by a ring or closed- 
chain formation, 

cyclic permutation . Any rearrangement of a sequence of elements which 
rearrangement merely involves shifting all the elements a con it on 
distance to the right or left of their initial positions in the 
sequence, the relative order remaining undisturbed; such a 
rearrangement requires that one consider the basic sequence as 
being circular in nature so that, for example, shifting that 
clement which occupies the left-most position in the sequence one 
place to the loft places this element in the right-most position, 

dally keying element . That part of the specific lcey which changes at* 
predetermined intervals, usually dally, 

decimated alphabet . An alphabet produced by decimation, 

decimation , n. The process of selecting members of a series by counting 
off at a chosen interval, tho original series being treated as 
cyclic; or the result of the foregoing process. 

«• i — 

decimation-mixed sequence, A mixed sequence produced by decimation. 

decipher , v.t. To convert an enciphered message into its equivalent plain 
text by a reversal of the cryptographic |?robc3S dbed in encipherment, 
(This does not include solution by cryptanalysis.) 

* * ir » * *.•» .* #» » i - 

» • ” * 

deciphering alphabet . A cipher alphabet in which the sequence of symbols 
^ in the cipher component is arranged in normal order for convenience 
in decipherment, 

. - «# t y * . v T1 r " ■“* 

decipherment , n, 1. The process of deciphering. 2. The plain text of 
a deciphered cryptogram. 3» In an enciphered code system, the code 

text resulting from the removal of the encipherment. 

\ 

decode, v.t. To convert an encoded message into its plain text by moans of 
a code book. (This does not include solution by cryptanalysis.) 

— n. 1. That section of a code book in which the code groups are in 
alphabetical, numerical, or other systematic order. 2. The decoded, 
but not translated, version of a code message. 

decodomont , n. 1. The process of decoding. 2. The decoded, but not 
translated, version of a cryptogram. 

__ j 

decrypt , v.t. To transform an unintelligible or cryptic communication into 
an intelligible one by a reversal of the cryptographic process used in 
onpryptment. (This does not include solution by cryptanalysis.)— n. 

A decrypted, but not translated, message. 

decryption , n. The act of decrypting. 

degnrble , v.t. To make emendations in a garbled text. 
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derived nnmciical Key * A key produced "by assigning numerical vulx^es to 
a selected literal key. 

diagnosis , cryptanolytlc ■ A systematic es^amination of cryptograms with 
a view to discovering the general system underlying these crypto- 
grams . 

digraph , n. A pair of letters. 

digraphic , edj. Of or pertaining to any combination of two characters. 

r *. l 

digrspMc frequency distribution . A frequency distribution of successive 
pails of letters or characters. A di graphic distribution of ABCDEF 
would list the pairs: AB, CD, EF, (Cf. b 3 literal frequency distri - 

bution .) 

dl graphic idlomorph . A plainiext or cipher sequence which contains or 
shows a pattern in its construction as regards the number and 
position of repeated digraphs. 

digraphic substitution . Encipherment by substitution methods in which 

the plaintext units are pairs of characters and their cipher equiva- 
lents usually consist of two characters. 

dinoma , n, A pair of digits. 

direct standard cipher alphabet . A cipher alphabet in which both the 
plain and cipher components are the normal sequence, the two com- 
ponents being Juxtaposed in any of the non-crashing placements. 

discriminant , n. A group of symbols indicating the specific crypto- 
system used in encrypting a given message . Also called sysbem 
indicator . 

distribution , n. See frequency distribution . 

doublet , n. A double-letter digraph, such as, LL , EE , etc. 

double tip exposition . A cryptosystem in which the characters of a 
first or primary transposition are subjected to a second trans- 
position. 

encicofle , n, A portmanteau word for enc lphored ' code . ’ 

encipher , v.t. To convert a plaintext 1 ' message into unintelligible 
'language by means of a cipher system. 

enciphered code . A cryptographic system in which a cipher system is 
applied to encoded text. 
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enciphering alphabet * A cipher alphabet in which the sequence of letteys 
xn the' pl^n coxiifonent is arranged in normal order for convenience 
in encipherment* 

encipherment , n« 1. The process of enciphering. 2. Text which has been 
enciphered. 

encoded cipher * The final text produced by enciphering the plain text and 
then encoding the enciphered text. 

encode, v.t. To convert a plaintext message into unintelligible language' 
by means of a code book, — -n. That section of a code book in which 
the plaintext equivalents of the code groups are in alphabetical, 
numerical, or other systematic order, 

encodexcent , n. 1. The act or process of encrypting plain text with a 
code system. 2. The text produced by encoding plain text, 

encrypt , v.t. To convert a plaintext message into unintelligible language 
by means of a cryptosystem. 

encrypted text . The text produced by the application of a cryptosystem to 
a plaintext message. 

encryption , n. l. The act of encrypting, 2. Encrypted text. ^ 

external text . In concealment systems, the apparently innocent enveloping 
text within which a secret message is hidden. 

four-level dinome cipher . A biliteral substitution cipher system employ- 
ing' four cipher sequences composed of two-digit numbers, by means K 
of which all or nearly all of the plaintext letters are provided 
with four two-digit variant equivalents . " 

four-squ a re matrix system . A digraphie substitution system employing a 
matrix which' usually consists of four 5x5 squares in which the 
letters of 25-element alphabets (usually combining I and J) are 
inserted according to any prearranged order. 

r r ^ » , * - — w 

fractionating system . A cipher system in which plaintext units are . 
represented by two or more "cipher symbols which in turn are dis- 
sociated and subjected to further encipherment by substitution or 
transposition or both. ^ A _ 

_ lr 

f ractionation , n. A cryptographic process where JLn the cipher symbols, 
which combined represent a plaintext unit, are dissociated and sub- 
jected to further encipherment. 
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frequency distribution . A tabulation of -l he frequency of occurrence 
of plaintext or ciphertext units in a message or a group of 
messages . A frequency count . 

frequent laD matrix . A type of cipher matrix providing variants. A 
matrix in which the ntmiber of different cipher values available 
to represent any given plaintext letter closely approximates lbs 
relative plaintext frequency. 

garble , n. An error in transmission, reception, encryption, or de- 
cryption which renders incorrect or un&ecryptable a message or 
transmission or a portion thereof. — -v.t. To make an error in 
transmission, reception, encryption, or decryption of a message. 

V , ' V J 

general cryptosystem . The basic invariable method of encryption 
included in a cryptosystem, excluding the specific keys 
essential to its^ employment . 

generatrix , n. In connection vith the method of completing the 
plain component sequence, any one of the rows, each of which 
represents a trial "decipherment 11 of the original cryptogram. 

* i • 

. * ' J f i 

Grandprc cipher . A type of substitution system providing variants. 

“““ Tliis system employs a cipher square in which are inscribed ten 
10-letter words containing al] the letters of the alphabet in 
their approximate plaintext frequencies. These ten words are 
further linked together hy a 10-letter word which appears 
vertically in the first' column ss a mnemonic feature for the 
inscription of the words in the rows . 

grid, n. In a transposition system, a form or matrix over which a 
grille is placed for the purpose of enciphering or deciphering. 

grille , n. 1. A sheet of paper, cardboard, thin metal, plastic, or 
like material In which perforations have been made for the uncov- 
ering of spaces In which textual units may be written or read 
on the grid. 2. A matrix In which certain squares are blocked 
out or otherwise marked so as not to be used. 

group , n. A number of digits, letters or characters forming a unit 
for transmission or for cryptographic treatment. 

hlgh-echelon , adj, Pertaining to organizational units at the army 
divisional level or higher, or their equivalents, 

high-grade , adj . Pertaining to a cryptosystem which offers a maximum 
of resistance to cryptanalysis; for example: (l) complex cipher 

machines, (2) one-time systems, (3) two-part codes enciphered 
with an additive book. (Cf . lcrw- grade sold medium-grade . ) 
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H113 *s algebraic encipherment . A true polygraphic system for the encipher- 
ment of polygraphs of any order, involving algebraic treatment by 
means of coefficients for the transformation of a plaintext poly- 
graph into its ciphertext polygraphic equivalent, and vice versa* 
Invented by Professor Lester S. Hill of Hunter College* 

hit, n« A coincidence or identity * 

horizontal two-square matrix system . A digraphic substitution system em- 
ploying a matrix -which normally consists of two 5 x 5 squares placed 
side by side. 

w - * „ 

identification , n. Determination of the plaintext meaning of a cipher 
element or code group, * _ 

v •* ” ,r r n 

identify , v.t. To determine the plaintext meaning of a cipher element or 
code group. , \ _ , ,, , . - ’ 

ldiomorph , n, A plaintext or cipher sequence which contains or shows a 

~ pattern in its construction as regards the number and positions of 
repeated letters. ' __ 

~ i - ; * * i 

idlomorphism , n. In a plaintext or cipher sequence, the phenomenon of 
showing a pattern as regards the number and positions of repeated 
letters. ‘ 1 ' ~ ' pu 

r c t 1 

index of coincidence . The ratio of the observed number of coincidences in 
a given cryptogram to the number of coincidences expected in a sample 
of random text of the same size as the cryptogram. 

Indicator , n. In cryptography, an element Inserted within the text or 
heading of a message which serves as a guide to the selection or 
derivation and application of the correct system and key for 
the prompt decryption of the message. See also the more precise 
terms discriminant and message Indicator . 

inscription , n. In a transposition system, the process of writing a mes- 
sage into a matrix. 

Integer, n. A whole number. 

intercept , v.t. In its cryptologic application, to gain possession of 
communications which are intended for other recipients, without 
obtaining the concent of the addressees and without preventing or 
ordinarily delaying the transmission of the communications to those 
addressees.— n. A copy of a message obtained by interception. 

interception, n. The process of gaining possession of communications in- 
tended for others without obtaining the consent of the addressees 
and without preventing or ordinarily delaying the transmission of 
the communications to those addressees. 
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Internal text . In concealment systems, the secret text which Is envel- 
oped by open or apparently Innocent text. 

International teleprinter code * See Baudot code. 

Interrupted-key columnar transposition , A columnar transposition system 
In vhlch the plaintext elements are inscribed in a matrix in rows of 
irregular length as determined by a numerical key. 

Inverse four-square matrix system . A four-square matrix system in which 

the cipher sections contain normal alphabets while the plain component 
sections contain mixed alphabets. 

Invisible writing . Writing not visible to the naked eye; the characters 
composing such writing may be microscopic or inscribed with invisible 
ink. 

Isolog, n. A cryptogram of which the plain text is identical with that of 
another message encrypted in another system, key, code, etc. 

isologous , adj . Pertaining to or having the nature of an Isolog. 

Jefferson cipher . A polyalphabet ic substitution system Invented by Thomas 
Jefferson and independently at a later date by the French cryptographer 
Bazeries. It provided for encipherment by means of a manually operated 
device Involving a number of revolvable disks, each bearing a mixed 
alphabet on its periphery. 

kappa plain constant . A constant employed in coincidence tests to denote 
the'' probability of coincidence of a given textual element or unit In 
plain text. It is the sum of the squares of the probabilities of 
occurrence of the different textual elements or units as they are 
employed in writing plain text; for example, in English telegraphic 
plain text, the monographic and digraphic kappa plain constants are 
♦0667 and .0069 respectively. 

kappa random constant . A constant employed in coincidence tests to denote 
the probability of coincidence of a given textual element or unit in 
random text. It Is merely the reciprocal of the number of different 
elements or units of which the cipher text may have been composed; 
if a 26-letter alphabet were employed, for Instance, the constant 
denoting the probability of coincidence of various textual elements 
would be derived as follows; 

a. single letters l/2 6 « .0385 

b. digraphs 1/676 * .00148 

c. trigraphs l/l757 6 ■ .OOOO57 

kappa test . See coincidence test . 

key, n. 1. In cryptography, a synibol or sequence of symbols applied to 
successive textual elements of a message to accomplish their encryp- 
tion or decryption. 2. A specific key. 
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key 'book * A book containing key text, or plain text forming specific keys. 

— - *r 

keyed columnar transposition . A transposition system in which the columns 
of a matrix ore token off in the order determined by the specific key, 
which. Is often a derived numerical key. ’ 



key phrase . An arbitrarily selected phrase from which a key Is derived. 

n i l i i 

key recovery . The cryptanalytie reconstruction of a key. 

key text . Text from which key is derived* ~ - - 

- e ri. 

key word . An arbitrarily selected word used as a key per se , or from which 
a key is derived. 

f K r ’ i J ■* « f 

keyword-mixed alphabet . An alphabet constructed by writing the prearranged 
key word or key phrase (repeated letters, if present, usually being 
omitted after their first occurrence), and then completing the se- 
quence from the unused letters of the alphabet in their normal sequence 

I "*'■ » " * p i * i 

lambda (A) test . A test for monoalphabeticity in a message, based on a 
comparison of the observed number of blanks in its frcqufefccy distri- 
bution with the theoretically expected number of blanks both xn 
(a) a normal plaintext message of equal length and (h) a random 
assortment of an equal number of letters. Also called the blank- 
expectation test . " * »T7TTT~ 

4 j. ' Ut ' V lll^t ^ | - 

latent repetition . A plaintext repetition not apparent in cipher text 
but susceptible of being made patent as a result of analysis. 

n, - *• •»■*'». r *r - , r * r •yeit f 

Latin square . A cipher square in which no row nOr column contains a 
repeated symbol. - ■ ** » ’* ' 

"!■ « 4 V > 1/ | ^ ‘ 1 If » 4 4 / “ 

lexical * adj. Of, pertaining to, or connected with words . In its crypto- 
logic sense, the word is used to characterize those cryptographic 
methods (chiefly codes) which deal with plaintext elements comprising 
complete words, phrases and sentences. 

r - _ » .J* _t * v; I — at - 1 - “ erf 

literal key . A key composed of a sequence of letters. * 

r * - \. g ~ ~ f t. r - , T • r 

logarithmic weights , numerical weights assigned to units of plain text, 
which weights are actually logarithms of the probabilities of the 
plaintext units, and which are used to evaluate the results of certain 



cryptanalytie operations. 



nf 



low-echelon , adj. Pertaining to organizational units below the level of 
the army division or its equivalent. 



low-grade , adj . Pertaining to a cryptosystem which offers only slight 
resistance to cryptanalysis; for examples (l) Playfair ciphers, 
(2) single transposition, (3) unenciphered one-port codes. (Cf. 
medium-grade and high-grade) . 
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matrix , n. A geometric form or pattern. In transposition ays toms, 
the figure or diagram in which the various steps of the trans- 
position arc effected; in substitution systems, the figure or 
diagram containing the sequence or sequences of plaintext or 
cipher symbols. 

modlixm-gradc , adj. Pertaining to a cryptosystem which offers con- 
siderable resistance to cryptanalyais; for example; (l) strip 
ciphers, (2) polyphase transposition, (3) unenciphered two- 
part codes. (Cf. low-grade and high-grade ) * 

message, n, Any thought or idea expressed in plain or secret lan- 
guage, prepared in a form suitable for trans'misslon by any 
means of communication. 

message indicator . That part of the specific key which changes with 
every message. 

message keying element . See message indicator . 

mixed cipher alphabet . A cipher alphabet in which the sequence of 
letters or characters in one or both of the components is not 
the normal sequence, 

mixed-length system . A cryptosystem in which the units of cipher 
text or code text are of Irregular or non-constant length, as 
for example, a monome-dinome system, or a code system employ- 
ing both 4- letter and 5 -letter groups. 

mnemonic key . A key so constructed as to be easily remembered. 

modulo , adj. Pertaining to a cyclic scale or basis of arithmetic. 
(Abbreviated as mod ; e.g., mod 10, mod 26, etc.) 

modulus , n. Scale or basis of arithmetic; the number n is called 
the modulus when all numbers which differ from each other by 
n or a multiple of n are considered equivalent, 

monitor, v.t. To intercept and copy one's own or friendly radio and 
wire transmissions for the purpose of detecting and correcting 
violations of regulations. 

- a. 

monoalphabeticity , n. A characteristic of encrypted text vhich indl- 
cates that it has been produced by methods involving a single 
cipher alphabet or single code book, unenciphered. It is 
normally disclosed by frequency distributi ons which display 
"roughness", or pronounced variation in relative frequencies. 
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HKWOrtJ p tio ty h Ip _ nub at itutlon , A typo Of substitution 'employing a sing! e 
cipher alphabet 'by means of which oacli cipher equivalent , eoirtpon?3^ c * 
of ones or more demon fcs , invariably represents one particular plain- 
text uni b , wherever it occurs throu^tiout any given message* 

r 

jQonogfftphloi adj « of or pertaining to any units comprising single cliarae- 
tors » 

L 

jiiouogi ayh lc aubst ltut ion » ftneipherment by substitution methods in wh£eh 
the plaintext' units are Bittglo characters and their cipher equiva- 
louts usually consist of single characters* 

moftoma * ft* A single digit* 

^norca-dinome system . A substitution system in which certain plaintext 
elements’ have single-digit cipher equivalents, while Others, are ^ 
represented by pairs of digits • 

multiliter nl , adj* of or pertaining only to cryptosystems, cipher alpha- 
"" ^ ’ bcis/ aiid frequency distributions which involve cipher units of two 
or more letters or characters *_ _S.ee the more inclusive term, poly- 
ginphic , 

multiliteral cipher alphabet , A cipher alphabet in which one plaintext 

" " f letter is’ represented by cipher units comprising two or more elements * 

aultillterfll system . A substitution system involving one or more multl- 

* literal" cipher alphabets, _ 

4 

multiple alphabet system , A type of substitution in which successive 

lengthy ^ovtilona of a message are each mo no alphabetically enciphered 
by a different alphabet j monoaiphabetlc encipherment by sections, 

non-carrying sura * A s\sa produced in cryptographic (mod 10) arithmetic* 

non-crashing , adj, A term used to describe that feature of the structure 
of certain cryptosystems which does not permit a plaintext unit to be 
self-enciphered, 

- ^ 1 

non-commutat ive , adj. As applied to bipartite cipher matrices, so con- 
structed that row and column coordinates must be read in a certain 
prescribed order (for example, in a rov-coluan order), 

— w _* * 

normal frequency . The standard frequency of a plaintext unit or letter 
relative to other such units or letters, as disclosed by 'the statis- 
tical study of a large volume of text, „ - 

~ ■* < t * 

normal sequence . The normal alphabetical sequence of those letters which 
are used in the written text of any particular language, or any cyclic 
permutation thereof. 
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norm! unllltoral frequency distribution . A distribution showing 
tbo standard relative frequency of single plaintext symbols as 
disclosed by statistical study of a lorge ■volume of text. 

null, n. In cryptography, a symbol or unit of encrypted text having 
no plaintext significance. 

numerical key . A key composed of a sequence of numbers. 

numerically-keyed columnar transposition . A transposition system in 

1 , * which tte^colvurins of a matrix are taken off in the order deter- 
mined by a numerical key. 

off the cut . As .applied to the division of cipher text into poly- 1 
graphs, beginning elsewhere than with the initial character of 
a hona fide polygraph. 

rf * - 4 „ 

one-part code . A code in which the plaintext elements arc arranged 
in alphabetical or numerical order accompanied by their code 
groups also arranged in alphabetical or numerical order. 

one-time pad . A form” of key book used in a one-time system so de- 
signed as to permit the destruction of each page of key as 
soon as it has been used. 

one-tlm3 system . A cryptosystem in which the key, normally of a 
random nature. Is used only once. 

on the cut . As applied to the division of text into polygraphs, be- 
ginning with the first textual character. 

open code . A cryptosystem in which units of plain text are used as 
the code equivalents for letters, numbers, words, phrases or 
sentences. The code equivalents themselves, usually words or 
phrases, can be combined to form the intelligible text of 
apparently innocent messages. (Cf. concealment syst em.) 

originator, n. The individual (a commander or his officially desig- 
nated representative) by whose authority a message is sent. 

padding , n. Extraneous text added to a message for the purpose of 
concealing its length and beginning or ending or both. 

paraphrase , v.t. To change the phraseology of a message without 
changing its meaning. 

partlally-polygra'phic system . Any polygraphic substitution system 
' in which the encipherment of certain members of the poly- 
graphs shears group relationships; small matrix systems, such 
as the four-square, tiro-square and Playfair syt»t- 13 involve 
such group relationships and are considered to 1 e partially- 
digrapliic syct^s . 
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par t II Ion, n. Becolulion of on Integer into a set of integers (c.g., 
rtprcficni otlon of the integer 6 as 1 and 5> 2 and b f 3 and 3) • 

~ J 1 I < V> | ' f 1* i ^ 

pnlonl repetition * A repetition which is oicternally visible in tho 
original cryptographic text* 

, - n If * r ~ r I % . *"* -* i m ** 

ponto graph , n. A set of five letters. ’ * rr 



pentarowo , n. A set of five digits. 



periodic substitutio n . Periodic polynlphabctle substitution. A method 

of cncipEcrraonl involving tho cyclic use of a plurality of al phabets . ^ 

permutation table . A tablo designed for the systematic construction 
codo groups. It may also be used to conect garbles in groups of 
code text. * * - •* *' ~ * T ' t> r *” *-*• * • u it * 

t\ * f n i % «' 



phi ( <&) tost. A test appli ed to a frequency distribution to doterpixi l c]^bs 
relative monoalphaboticity. See also kappa plain" constant and^ kapp a’ 1 
random constant. * "* * 






phy sic al securi ty. That component of communication security which results 
~froxsi all physical measures necessary to safeguard Classified contorani- 
cation equipment and material from access thereto by unauthorized 
persons. r * 



placode, n. 
"code. 



A portmanteau word usccl to designate plain or uncnciphorcd 

■» i ■» ' • 



plain code . Unenciphered code. 




*> 






__ That component of a cipher alphabet which comprises the 

sequence of plaintext symbols, - ^ 



plain compo ne nt eq u ivale nts. In connection with the method of completing 
tiie plain componeiii'lseqvcnce, the plaintext equivalents for cipher 
units derived from on arbitrary juxtaposition of the components of a 
cipher alphabet. 



plain languag e. Plain text, ' - * 



plain tex t ( cl ecr -tex t) . 1. Text or language which conveys an Intelligible 

mooning in tSe^language in vhich it is written, with no hidden meaning. 
2. The intelligible text underlying a cryptogram. 



system . A type of digraphic substitution using a single matrix 
normally of 25 cells. 
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Poisson table . Table of the Poisson, distribution* A type of math- 
cnAtical table containing probability data applicable to the 
phenomena of repetitions expected to obtain in samples of random, 
text; used in cryptanalysis to determine vhether or not the 
repetitions observed in a given sample of cryptographic text are 
causal repetitions or accidental (random) repetitions* 

polyol phabot ic substitution . A type of substitution in which the 
successive plaintext elements of a message, usually single 
letters, are enciphered by a succession of different alphabets 
which may bo used more than once and which are used in a 
predetermined order# 

polysrerhic , ad3« Of, pertaining to, or connected with any group- 
ings comprising two or More letters or characters* 

** 

polygraohic substitution * Encipherment by substitution methods in 
which" the plaintext units are regular length groupings of more 
than one element* 



polyphase cnciphcrrent . Any system of encryption involving two or 
more successive operations of cnciphcment. 

probable word . Plain text assumed or known to be present in a crypto- 
gram. A crib. 



probable-word method * The method of solution involving the trial of 
plain text assumed to be present in a cryptogram. 

- i 

proforma message . A message in standardized form, designed to convey 
intelligence by conventions of arrangement and abbreviation. 



pseudo-code system . A cipher system which produces a cryptogram whose 
groups resemble those produced by a code system. 

pseudo-polvgrrphic system . A polygraphia substitution system in which 
at least one of the ' letters in each polygraph is enciphered mono- 
aJLphahetically. ~ " 



quinqueliteral alphgbet . A cipher alphabet in which each plaintext 
, , fetter is. represented by a 5-character equivalent. 

1 -r 4 ~ ^ ^ A. - ket i ' 4 * i , 

random-mixed cipher alphabet . A cipher alphabet in which the letters 

carprising the" plain or cipher component have been mixed at random. 
(Cf. systematically-mixed cipher alphabet ). 

random text . Text which appears to have been produced by chance or 
accident, hsvipg no discernible patterns or limitations. 

'“►a -» - - - » 
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rapid analytical machinery . Any high-speed cryptanalyblc machinery, ( 

usually" electronic or photoelectric in nature. * ~ “ 

ft. r t. 

> 

raw traffic . Intercepted traffic showing no evidence of processing for 
communication intelligence purposes "beyond sorting by clear address 
elements, elimination of unwanted messages, and the inclusion 0 $ a 
case number snd/or an arbitrary traffic designator. 

reciprocal cipher alphabet . A cipher alphabet in which either of the tyo 
sequences may serve as plain or cipher since the equivalents exhibit 
reciprocity. * ' ’ 

reciprocity , n. As used in cryptology, interchangeability of plain-cipher 
relationships (e.g., Ap = B c and Bp = Ac). 

* * * ~ * T 

related alphabets . Any of the several secondary cipher alphabets which 
are produced by sliding any given pair of primary components against 
each other. 

( 1- * 

relative code . Code text from which an cncjphorirent has been removed in 
relative terms but not reduced to plain-code text, so that the groups 
differ from the actual, original plain code by an interval constonb 
for every group; thus the difference between two relative code groups 
is the same as that between their plain-code equivalents. 

repeatlng-key method . See periodic substitution . 

repe bitive encipherme nt . A typo of encipherment in which the primary cipher 
texFof a cryptogram is subjected to further encipherment with either 
the sems or a different system. Double transposition is a frequently- 
encountered example of repetitive encipherment • v 

t X' < 

rever s ed standard cipher al p habet . A cipher alphabet in which both the 
"plain end cipher coiuroncn t3 ere the normal sequence, the cipher com- 
ponent being reversed in direction from the plain fedmfconent. 

reversibility , n. That characteristic of the relationship betoieen a plain- 
text' digraph and its cipher digraph equivalent which permits the 
elements of each to be reversed (e.g., ABp » CDe and BAp » DC c ). 

revolving grille . A type of grille in which the apertures are so distri- 
buted that when the grille is turned successively through four angles 
of 90 degrees and set in position on the grid, all the cells on the 

grid are disclosed only once. Also called rotating grille.' 

— ■” rni T— 

rotating grille . See revolving grille . „ : 

rotor , n. A disk which is designed to rotate within a cipher machine and 
which controls the action of some other machine component or produces 
a variation in some textual or keying element. 
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rg^ Tmofle , n. That characteristic of a frequency distribution where 
~Thcre is displayed in the distribution a pronounced v^ri'-tion in 
relative frequencies of the elements considered. (Cf. s )othncno *) 

route transp o si-Hon . A method of transposition in which the cipher- 
text equivalent of a message is obtained by transcribing, accord- 
ing to any prearranged route, the cells of a matrix into rhich 
the message was inscribed earlier according to sone other pre- 
arranged route. 

row coordinate * A symbol normally at the 63 de of a matrix or crypto- 
graphic table, identifying a specific row of cells, used in con- 
junction with a coluiin coordinate to specify an individual cell 
in the matrix or tab] c . Also called row lndicctor . 

row indicator. See row c -• ordinate « 



running dinoro distribution * A biliteral distribution mede on digit 
text. 

secret ink . Any of several chemicals used for writing or printing which 
have the property cf being initially invisible to the n^ked eye or 
of becoming so after a short time. Also called invisible ink or 
sy pathetic ink. 



secret la^ ^uiage. Text which conveys no intelligible meaning in any 
l^ga-ge or which conveys an intelligible moaning that is not 
the real, hidden meaning. 

secret siting . 1. Visible siting in secret language. 2. Invisible 
writing. 

separator , n. See word separator . 

sequence, n. " An ordered arrangement of symbols (letters, digits, etc.) 
having continuity. Specifically, the members of a component of a 
cipher alphabet in order; the symbols in a row, column, or diagonal 
of a cipher square in order; key letters or key figures in order. 

r 

setting, n. The arrangement and alignment of the variable elements of 
a cryptographic device or machine at any moment during its opera- 
tion. ' ' - 

elpr^ c.n. As used in cryptomathematics, a measure of the standard 
deviation from normal, expressed in terms of sigma ( <5*) . 

simple substitution . Monoalphabetic unillteral substitution. 

simple transposition . See single transposition . 
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single transposition . A transposition In which only one Inscription and 
one transcription are effected, " - 

•*- i " k 

smoothness , n. That characteristic of a frequency distribution where there 
is displayed in the distribution no pronounced variation in relative 
frequencies of the elements considered* (Cf • roughness ) n 

solution , n. In its cryptanalytic application, the process or result of 
solving a cryptogram or cryptosystem by cryptanalysis, 

solve , v.t. To cryptanalyze . To find the plain text of encrypted communi- 
cations by cryptanalytic processes, or to recover by analysis the keys 
and the principles of their application, 

specific key . An element which is used with a specific cryptosystem to 

"*”*'*' determine the encipherment of a message and which includes both the 
me ssage keying element and the dally keying element . It may consist 
of a letter, number, word, phrase, sentence, a special document, book, 
or table, etc., usually of a variable nature and easily changeable 
at the will of the correspondents, or prearranged for them or for 
their agents by higher authority. 

square, n. See matrix , 

standard cipher alphabet . A cipher alphabet in which the sequence of 
letters in the plain component is the normal, and in the cipher 
component is the same as the normal, but either reversed in direction 
or shifted from its normal point of coincidence with the plain com- 
ponent. 

standard uniliteral frequency distribution . See normal unlliteral fre - 
quency distribution . 

stereoty pe, n. A word, number, phrase, abbreviation, etc., which as a 
result of language habits, has a high probability of occurrence, 
especially at the beginning or ending of a message. 

• .* i 

stereotyped massages . Related encrypted messages which are recognizable 
rs such because of distinctive characteristics of the underlying 
plaintext. **- „ 

^ ~\§ ~ 2 , * ~ ~ 

strip cl ub' p device . A cipher device employing sliding alphabet strips. 

substitution al phabet . See cipher alphabet . 

*' fc ~ ^ u i 

substitution cipher . 1. A cipher system in which the elements of the plain 
text are replaced by other elements . 2 . A cryptogram produced by 
enciphering a plaintext message with a substitution system. 

substitution system . A system in which the elements of the plain or code 
text ore replaced by other elements. 
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* fl m A> t A prose] cc bod digit (normally the final digit) 
hi - cc « 01 ci;licr g^onp i ’itch in the no^-cerTying cum of 
tlv> other digits an the group. 

snrinipg-trirora tv "»» , A substitution nyatem in which c-'ch pi •"in- 
text letter in '^olgncd *» unique nr cried vc” ’O of 0 to ?t» 
Thas volne is then cxprc 3 scO es a trinone, tiiO digits of vluch 
sum to tho designated value of the letter. 

r 

suporcnci pherment , n. A fom of nuperencryption in which tho final 
step involves encipherment , — v.t. Pnp^rcnca pher . 

Bttpcrencrypta on, n. A further encryption of the text of a cryofco- 
gr on for increased. security. Enciphered code is a ftrtqu 'ntly- 
encourvtorcd example of superencrypt Ion , — v .t . gup^x^ncryrit . 

switch group . A group used within a message to indicate that the 
following textual elements are encrypted with a different key 
or code hook. 



syllabary, n. In a code took, a list of individual letter's, combi- 
nations of letters, or syllables, scconpcnied by th^lr equiva- 
lent code groups, usually provided for spelling out words or 
proper names not present in the vocabulary of a code; a spell- 
ing table. 



syllabary square . A cipher matrix containing individual letters, 
digits, syllables, frequent digraphs, trigraphs, etc., which 
are encrypted by the row and column coordinates of the matrix. 



syllabi c, adj. Of, pertaining to, or denoting syllables. 

— i ^ S •* 



system, n. See cryptosystem . 



systemt leal l y-mlxcd cipher alphabet . A cipher alphabet in which 
tEe corporent tSat 15" maxed TIao been disarranged by systematic 
procedure. (Cf . rpp dom-iriied cipher alphabet ) - 



system indicator . See discrjminant . 



teleprinter, n. An electrically-operated instrument resenibling a 
typem iter, used for the transmission and reception-printing 
of messages by electrical means. Also called teletypewriter . 

t * 

teletypewriter , n. A teleprinter . 



tetragraph, n. A set of four letters. 






tetr gnome , n. A set of four digits. 
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text , n. She part of a message containing the basic information which, the 
originator desires to ho ccnrmunicotcd* * 

. 

traffic* n. Ml transmitted comnmlcations. 

■■■■■■■■ i 

i 

traffic analysis . That branch of cryptology which, through a study o? ”* 
signal transmissions by all means short of cryptanalysis of message 
texts, assembles information eoncomang communication networks* This 
information is used (l) as a guide to further interception; (2) as an 
aid to cryptanalysis; (3) as a source of intelligence even In the 
absence of decrypted message texts; and (4) to strengthen our own 
security hy discovering weaknesses in our communications and by 
avoiding weaknesses discovered m the communications of others* 

traffic intercept * A copy of a c omm unication obtained, through interception. 

transcription* n. In fe tiansposition system, the process of removing the, 
text from a matrix or grid by a method or route different from that * 
used in the Inscription* 



transmission security . That component of communication security which 
results' froa al3' measures designed to protect transmissions from 
Interception end traffic analysis, . . u * . * * * 

' * T * - p « 1 r „ f 

transparency, direct * That characteristic of cipher text which indicates 
" ' T&srfc" 'certain plaintext elements may have been self-enciphered* 

_ * i « “■ r -»l • ■■ ft. I* 

transparency* inverse . That character* stlfe of cipher text which indicates 
that certain cipher digraphs, may he merely reversals of the correspond- 
ing plaintext digraphs* 

a c • • - , ^ri - • ( t 

transposition cipher . 1* A transposition system* 2* A cryptogram pro- 
duced by enciphering a message with a transposition system* 

transposition-mixed cipher alphabet . A cipher alphabet in which st least 
one coiiiponcni"l[plain or clpFef ) has been constructed by 'applying a 
form of transposition to either a standard or a mixed sequence* 



transposition system . A cryptosystem in which Uiejalements of plain text^ 

whoxher inttividvt al letters, Groups of letters, "syllables, words, phrases, 
sentences, or code groups or their components undergo some change In 
thoir xelatlve positions without a change in their identities* • 

** ”-i « ■» . * ' ' n 

m “ ■“ ■“ — m mm — 

tri graph, n* A Bet of three letters* 

z *. _ » j- r- 



tiigrotiihic, adj. Of or pertaining to any three-character group. 



hir^g phic f ieouancy distribution . A frequency ^distribution of succossive 
” ” tri crop* 3f ti j graphic frequency disfviwtion of AECECT rowld. gqn- , 

aider cly t*i ■'iigroTiihs A3P end PTF * (Of. taili t g»vl l ioo vcrcy distrl- 
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trlffrephic sidbatitution system , A substitution system In which the 
plaintext units ore composed of three elements • 

triliteral, adj, Of, or pertaining only to cryptosystems, cipher 
alphabets, and' frequency fiistrlSutions which involve ci pher 
units of three letters or characters. See the more inclusive 
term trigraphic ; see also triliteral frequency distribution . 

triliterol frequency distribution . A distribution of the characters 
In the text of a message in sets of throe, which Trill show: 

(a) each character with it 3 two preceding characters or (b) 
each character with its two succeeding characters, or, in its 
most usual form, (c) each character with one preceding and one 
succeeding character. A triliteral frequency distribution of 
A3BCPSF would consider the groups ABC , BCD, CCS , DBF . 

trinome, n. A set of three digits. 

trlnome-dlgraphic system . A substitution system in which plain- 
text digraphs are represented by 3-digit cipher elements, 

trough, n. In its cryptologic application, a point of low relative 
frequency in a frequency distribution. 

true polyg r aphia system . Any polygraphic substitution system in which 
the individual elements of the polygraphs display no evidence 
of Eonoalphabeticity, nor evidence of relationships within any 
group of polygraphs; that is, in a true polygraphic system, chang- 
ing one letter in any plaintext polygraph affects the equivalent 
ciphertext polygraph in its entirety. (Cf . partially-polygrephie* 
system and pscrdo-polygraphic system . ) ~ 

two-element differential . The characteristic incorporated in certain 
codes in which the groups differ from one another by a minimum 
of two elements, either in Identity or the positions occupied. 

■. When the elements are letters, the characteristic is called a 
two-letter differential ; when the, elements are digits, it is 
called a two-digit differencial . 

two-part code. A randomized cods;, consisting of an encoding section 
in which the plaintext groups are arranged in alphabetical or 
other significant order acbctapanicd by their code groups arranged 
, in a non-alphabetlcal or random order; and a decoding section, 
in which the code groups are arranged in alphabetical or numeri- 
cal order and are accompanied by their meanings as given in the 
encoding section. 

two-square matrix system . A digraphic substitution system which 
no.rn.lly enploys a matrix consisting of two 5x5 squares 
arranged either horizontally or vertically. 
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uniliteral , adj. Of, or pertaining only to cryptosystems, cipher alphabets 
and frequency distributions which involve cipher units of single* ‘ 
letters or characters. See the more inclusive ter® monographic ; see 
also uniliteral frequency distribution . 

* r- ?• 

unlliteral frequency distribution . A simple tabulation showing the fre- 
quency ’ of! ndividual characters of a text. 

unlliteral substitution . A cryptographic process in which the individual 
letters of a message text are replaced by single-letter cipher 
equivalents . 

variant , n. 1. One of two or more cipher or code symbols which have the 
same plain equivalent; also called variant value . 2. One of several 
plaintext meanings which may be represented by a single code group. 

variant system . A substitution system in which some or all plaintext 
letters may he represented by more than one cipher equivalent. 

variant value . See variant . 

vertical two-square matrix system . A digraphic substitution system employ- 
1 ing a matrix" ^whicfa normkLly consists of two 5*5 squares arranged 
vertically. 

Vlgenere square . The cipher square commonly attributed in cryptographic 
literature to the French cryptographer Vlgen&re, having the normal 
sequence at the top (or bottom) and at the left (or right), with 
cyclic permutations of the normal or other sequence forming the suc- 
cessive rows (or columns) within the square. 

1 A * , • , 

visible writing . Writing in which the characters are inscribed with ordi- 
nary writing materials and can be seen with the naked eye . (Cf • invis - 
ible writing .) , >: 

Wheatstone cipher device . A cipher device consisting essentially of two 
rings counted concentrically in! a single plane, .the outer (and larger) 
ring being the plain component of the device and comprising 27 equl- 
slzed divisions, the inner (and smaller) ring being the cipher com- 
ponent, comprising 2 6 similar divisions. The device incorporates two 
hands (similar to those on a dock) pivoted at the center of the de- . 
vice— the larger hand serving the outer ring and the smaller hand, 
the inner— so geared together that for each complete revolution of the 
larger, the smaller turns through one complete revolution plus one 
twenty-sixth. 

word pattern . The characteristic arrangement of repeated letters in a word 
which tends to make it readily identifiable when enciphered monoalpha- 
hetically. 
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word separator . A unit of one or acre characters employed In certain 
cryptosystems to indicate the space between words « It may be • 
enciphered or unenclphered* Also called a word spacer . 

word transposition * A cryptosystem In which whole words are trans- 
posed according to a cert alp pre a rranged route or pattern* 
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ENGLISH CRYPTANALYTIC DATA 
FREQUENCY TABLES 

Tkbl« No Pais 

1-A Absolute frequencies of letters appearing in five sets of Governmental plain-text telegrams, each set 

containing 10,000 letters, arranged alphabetically 4 

1-B Absolute frequencies of letters appearing m five sets of Governmental plain-text telegrams, each set 

containing 10,000 letters, ar-anged according to frequency 6 

1- C Absolute frequencies of vowels, high-frequency consonants, medium-frequency consonants, and low- 

frequency consonants appearing m five sets of Governmental plain-text telegrams, each set con- 
taining 10,000 letters 6 

2- A Absolute frequencies of letters appearing m the combined five sets of messages totalling 60,000 letters, 

arranged alphabetically 6 

2-B Absolute frequencies of letters appearing m the combined five sets of messages totalling 60,000 letters, 

arranged according to frequency 6 

2-C Absolute frequencies of vowels, high-frequency consonants, medium-frequency consonants, and low- 

frequency consonants appearing m the combined five sets of messages totalling 60,000 letters 6 

2-D Absolute frequencies of letters as initial letters of 10,000 words found m Governmental plain-text tele- 
grams (1) Arranged alphabetically and (2) arranged according to frequency 6 

2-E Absolute frequencies of letters as final letters oi 10,000 words found in Governmental plam-text tele- 
grams (1) Arranged alphabetically and (2) arranged according to frequency 7 

8 Relative frequencies of letters appearing in 1,000 letters based upon Table 2-B (1) Arranged alpha- 

betically, (2) arranged according to frequency, (3) vowels, (4) high-frequency consonants, (6) me- 
dium-frequency consonants, and (6) low-frequency consonants 7-8 

4 Frequency distribution for 10,000 letters of literary English, as compiled by Hitt (1) Arranged 

alphabetically and (2) arranged according to frequency _. 8 

6 Frequency distribution for 10,000 letters of telegraphic English, as compiled by Hitt (1) Arranged 

alphabetically and (2) arranged according to frequency 8 

6-A Frequency distribution of digraphs, based on 60,000 letters of Governmental plam-text telegrams, 

reduced to 6,000 digraphs 9 

6- B Frequency distribution of digraphs (naval text) based on 20,000 letters of naval text, reduced to 

2,000 digraphs 10 

7- 11 Absolute frequencies of digraphs, trigraphs, and tetragraphs and the logarithms of their as- 

signed probabilities 11-88 

7-A The 428 different digraphs of Table 6-A, arranged according to their absolute frequencies, accom- 
panied by the logarithms of their assigned probabilities — 18-16 

7-B The 18 digraphs composing 26% of the digraphs in Table 6-A, accompanied by the logarithms of 
their assigned probabilities, arranged alphabetically according to their initial letters (1) and ac- 
cording to their final letters (2) and according to their absolute frequencies 16 

7-C The 63 digraphs composing 60% of the 6,000 digraphs in Table 6-A, accompanied by the logarithms 
of their assigned probabilities, arranged alphabetically according to their initial letters (1) and ac- 

'' cording to their final letters (2) and according to their absolute frequencies... 16 

7-D The 122 digraphs composing 76% of the 5,000 digraphs in Table 6-A, accompanied by the loga- 
rithms of their assigned probabilities, arranged alphabetically according to their initial letters (1) 

and according to their final letters (2) and according to their absolute frequencies 17-18 

7-E All the 428 digraphs of Table 6-A, arranged first alphabetically according to their initial letters 

and then alphabetically according to their final letters 18 

8 The 428 different digraphs of Table 6-A, arranged first alphabetically according to their initial 
letters and then according to their absolute frequencies under each initial letter, accompanied by 

the logarithms of their assigned probabilities 19-21 

9-A The 428 different digraphs of Table 6-A, arranged first alphabetically according to their final 
letters and then according to their absolute frequencies, accompanied by the logarithms of their 

assigned probabilities 22-24 

9-B The 1$5 digraphs composing 26% of the 6,000 digraphs of Table 6-A, accompanied by the loga- 
rithms of their assigned probabilities, arranged alphabetically according to their final letters (1) and 
according to their initial letters (2) and according to their absolute frequencies 25 
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Table No Jttge 

9-C The 63 digraphs composing 60% of the 6,000 digraphs of Table 6-A, accompanied by the loga- 
rithms of their assigned probabilities, arranged alphabetically according to their final letters (1) and 

according to their initial letters (2) and according to their absolute frequencies - - 26-26 

9-D The 122 digraphs composing 76% of the 6,000 digraphs of Table 6-A, accompanied by the loga- 
rithms of tbeir assigned probabilities, arranged alphabetically according to their final letters (1) 

and according to their initial letters (2) and accordmg to their absolute frequencies 26-28 

9-E All the 428 different digraphs of Table 6-A, arranged alphabetically first according to their final letters 

and then according to their initial letters 28 

10-A The 66 trigraphs appearing 100 or more times m the 60,000 letters of Governmental plam-text tele- 
grams, arranged accordmg to their absolute frequencies, accompanied by the logarithms of their 

assigned probabilities 28 

10-B The 66 trigraphs appearing 100 or more times in the 60,000 letters of Governmental plam-text tele- 
grams, arranged first alphabetically accordmg to their initial letters, and then accordmg to their 

absolute frequencies, accompanied by the logarithms of their assigned probabilities 29 

10-C The 66 trigraphs appearing 100 or more times m the 60,000 letters of Governmental plam-text tele- 
grams, arranged first alphabetically accordmg to their central letters, and then according to their 
absolute frequencies, accompanied by the logarithms of their assigned probabilities 29-30 

10- D The 66 trigraphs appearing 100 or more times in the 60,000 letters of Governmental plam-text tele- 

grams, arranged first alphabetically according to their final letters, and then accordmg to their abso- 
lute frequencies, accompanied by the logarithms of their assigned probabilities 80 

11- A The 64 tetragraphs appearing 60 or more times in the 60,000 letters of Governmental plain-text 

telegrams, arranged accordmg to their absolute frequencies, accompanied by the logarithms of 

their assigned probabilities . ... 81 

11-B The 64 tetragraphs appearing 60 or more times m the 60,000 letters of Governmental plain-text 
telegrams, arranged first alphabetically accordmg to tbeir initial letters, and then according to their 

absolute frequencies, accompanied by the logarithms of their assigned probabilities 81 

11-C The 64 tetragraphs appearing 60 or more times In the 60,000 letters of Governmental plam-text 
telegrams, arranged first alphabetically accordmg to their second letters, and then according to 

their absolute frequencies, actfompadied by the logarithms of their assigned probabilities 82 

11-D The 64 tetragraphs appearing 60 or more times in the 60,000 letters of Governmental plain-text 
telegrams, arranged first alphabetically accordmg to their third letters, and then accordmg to 

their absolute frequencies, accompanied by the logarithms of their assigned probabilities 32-88 

11-E The 64 tetragraphs appearing 50 or more times m the 60,000 letters of Governmental plain-text 
telegrams, arranged first alphabetically accordmg to their final letters, and then according to their 

absolute frequencies, accompanied by the logarithms of their assigned probabilities 88 

12 Average length of words and messages 84 

18 Checkerboard individual frequencies 86 

14 Relative logarithmic values of frequencies of English digraphs 86 

16 Relative logarithmic values (Logo 222) of frequencies of English digraphs ... 87 

***** 

SPECIAL-PURPOSE DATA 

l6-A. Frequency distribution of digraphs, based on 6k, 3&5 letters of 
decrypted U. S. Government messages in which Z was used as a 

word- separator and X was used for both Xp and Zp.. 38 

l 6 -B. Frequency distribution of digraphs, based on the text used for 
Table 16 -A, from which the Z word-separator has been omitted 

(total. 53,866 letters).. ...... ..... ••....•••. 39 

l 6 -C • The 53 digraphs from Table 6 -A which comprise 50$ of the to- 
tal, arranged according to frequencies reduced to a base of 
5,000 digraphs, shown with the corresponding frequencies of 
the same digraphs from Table l 6 -B (also reduced to a base 
of 5 , 000 ) 40 
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Tabus 1-A — Absolute frequencies of letters appeanng vnfive sets of Governmental plaintext telegrams, 
each set containing 10,000 letters, arranged alphabetically 



Sat No 1 


Set No 2 


Set No 8 


Set No 4 


SCt No. 5 


Loiter 


Absolute 

Frequency 


.Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


A 


788 




788 




681 


A. - 


740 


A-_ - _ 


741 


B . 


mm 


■jjjttMM 


103 




98 


B„ 


83 


R 


99 


0 


319 




800 




288 


c 


826 


n 


301 


n 


887 


wwmm 


418 


VWHM 


428 


D 


461 


n 


448 


E. 


1,367 


E. 


1,294 


E 


1,292 


E 


1,270 


E. 


1,276 


p 


253 


F 


287 


F 


308 


F. 


287 


p 


281 


G 


166 


G 


176 


G 


161 


G _ __ 


167 


R 


150 


H 


310 


H 


351 


H. 


335 


H. 


849 


H 


849 


I 


742 


I . _ 


760 


I 


787 


I 


700 


T 


697 


.T 


18 


J 


17 


J 


10 


n 


21 


J 


16 


K 


36 


K 


88 


K 


22 




21 


K 


81 


r. 


365 


L. 


393 


L 


333 




886 


I. 


844 


M _ 


242 


M .. _ ... _ 


240 


M 


238 


M 


249 


M. 


268 


N 


786 


N 


794 


N _ 


815 


M 


800 


N 


780 


0 


685 


0 _ 


770 


0 _ 


791 


O 


756 


n 


762 


P 


241 


P 


272 


P 


817 


P 


245 


p 


260 


Q 


40 


P 


22 


Q_ _ 


45 


Q 


38 


Q_ 


80 


R 


760 


R. 


746 


R_ 


762 


R. 


786 


R 


786 


J 

s 


658 


S 


583 


S 


585 


S 


628 


a 


604 


T 


986 


T 


879 


T 


894 


T 


958 


T 


928 


n 


270 


II 


233 


U _ 


812 


II 


247 


II 


238 


V 


163 


V 


178 


V __ 


142 


V 


133 


V 


156 


w 


166 


w 


163 


W 


136 


W 


188 


W _ 


182 


x ........ 


43 


XL 


50 


X 


44 


X 


58 


X 


41 


v ....... 


191 


Y 


165 


Y 


179 


Y 


213 


Y _ 


229 


z_ 


14 


7. 


17 


Z. 


2 


7 . 


11 


Z 


6 






















Total 


10,000 




10,000 




10,000 




10,000 




10,000 













l 
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Table 1-B — Absolute frequencies of letters appearing m five sets of Governmental plain-text telegrams, 
each set containing 10,000 letters, arranged according to frequency 



Set No 


1 


Set No Z 


Set No S 


Set No 4 


Bet No 5 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


Letter 


Absolute 

Frequency 


EL 


1,367 


EL 


1,294 


MU 


1,292 


E 


1,270 


E 


1,275 


T _ _ 


936 


RM 


879 


hM 


894 


T_ _ 


968 


T _ 


928 


N 


786 




794 


N _ _ 


816 


N 




R 


786 


R 


760 




783 




791 


0 


756 


N 


780 


I 


742 


oMi 


770 


i 


787 


A 


740 


0 


762 


A 


788 


11 


760 


R 


762 


R 


786 


A 


741 


0 


686 


! R 


745 


A. 


681 


I 


700 


T . 


697 


S. _ 


658 




583 


S 


585 


S 


628 


s 


604 


D. 


887 




413 


D 


423 


D _ _ _ 


461 


D 


448 


T. 


365 




393 


R 


335 


L 


886 


H. 


349 


c. 


319 


H. 


351 


L. 


333 


H. _ 


849 


L 


344 


H. 


310 


C 


300 


P 


817 


C 


326 


C 


301 


U_ 


270 


F 


287 


U 


312 


F 


287 


F 


281 


F- 


253 


P _ _ 


272 


F 


308 


NL _ 


249 


M 


268 


1YL 


242 


M. _ „ 


240 


C _ _ . 


288 


U 


247 


V: ■ 


260 


P 


241 


U 


238 


M. . 


238 


P_ 


245 




238 


y 


191 


G 


176 


Y 


179 


y 


213 




229 


G 


166 


V 


173 


G _ 


161 


G 


167 


w 


182 


w 


166 


W 


163 


V 


142 


V 


133 


V 


155 


v 


163 


y 


165 


W 


136 


w 


133 


G 


160 


B 


104 


B 


103 


B 


98 


B 


88 


B 


99 


X. 


43 


x. __ _ 


50 


Q. 


45 


X. 


53 




41 


q 


40 


KL 


38 


X. _ 


44 


Q_ 


38 


K. 


81 


K 


36 


E7 


22 


K 


22 


K. 


21 


1 


30 


j_ _ 


18 




17 


J 


10 


J 


21 


J 


16 


z. 


14 


Ipfeii 


17 


Z_ 


2 


Z. _ 


11 


Z.__ _ 


5 


TotaL- 


10,000 


IBB 


10,000 




10,000 


— 


10,000 




10,000 



Table 1-C — Absolute frequencies of vowels, high-frequency consonants, medium-frequency con- 
sonants, and low-frequency consonants appearing m five sets of Governmental plain-text tele- 
grams, each set containing 10,000 letters 
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Table 2-A — Absolute frequencies of letters appearing in the combined five sets of messages totalling 

50,000 letters, arranged alphabetically 



A ... 


3,683 


G 


819 


L 


1,821 


Q- - 


175 


V . . 


766 


B 


„487 


H 


1,694 


M 


. 3,237 


R _. 


3,788 


W . . 


780 


C .. 


1,534 


I .. 


3,676 


N 


3,976 


S 


3,068 


X . . 


231 


D 


2,122 


J... 


82 


0 


. 3,764 


T . 


4,696 


Y 


967 


E ... 
F . . 


6,498 

1,416 


K . 


148 


P 


1,336 


U. . 


1,300 


Z . . . 


49 



Table 2-B — Absolute frequencies of letters appearing m the combined five sets of messages totalling 

50,000 letters, arranged according to frequency 



E 


.. 6,498 


I. . .. 


3,676 


C 


. 1,634 


Y 


967 


X. . 


231 


T . 


.... 4,695 


S . .. 


3,058 


F 


1,416 


G 


819 


Q 


175 


N 


.. 3,975 


D 


2,122 


P 


1,335 


W. .. 


780 


K 


148 


R. 


... 3,788 


L... . 


1,821 


U 


.. 1,300 


V 


766 


J ... 


82 


O... 


.... 3,764 


H._ 


1,694 


M 


. 1,237 


B 


487 


Z. ... 


49 



A 3,683 



Table 2-C — Absolute frequencies of vowels, high-frequency consonants, medium-frequency con- 
sonants, and low-frequency consonants appearing m the combined five sets of messages totalling 



50,000 letters 

Vowels , 19,888 

High-frequency consonants (D, N, R, S, and T) 17,638 

Medium-frequency consonants (B, C, F, G, H, L, M, P, V, and W) 11,889 

Low-frequency consonants (J, K, Q, X, and Z) 685 



Total 50,000 



Table 2-D — Absolute frequencies of letters as initial letters of 10,000 words found in Governmental 

plain-text telegrams 

(1) ARRANGED ALPHABETICALLY 



A . 


905 


G 


. ... 109 


L 


196 


Q - 


30 


V 


77 


B. . 


287 


H 


272 


M . _ 


384 


R 


611 


W .. 


320 


C 


664 


I 


. _ 344 


N . . 


441 


S ... 


965 


X . 


4 


D 


525 


J 


44 


0. 


646 


T ... 


1,253 


Y .... 


88 


E . 


390 


K 


23 


P 


433 


U. .. 


122 


Z_ . 


12 


F _ . 


855 


































Total 


_10,000 








(2) ARRANGED ACCORDING TO 


FREQUENCY 






T 


1,253 


R 


611 


M 


384 


L 


196 


J. . 


44 


S . - 


965 


D. 


525 


I 


344 


U 


122 


Q .. . 


30 


A 


905 


N 


. .. 441 


W 


320 


G 


109 


K 


23 


F. 


855 


P 


. . 433 


B 


287 


Y - 


88 


Z 


12 


C _. 


664 


E 


390 


H 


272 


V 


77 


X 


4 


0 


646 


































Total 


10,000 
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Table 2-E — Absolute frequencies of letters as final letters of 10,000 words found m Governmental 

plain-text tclegi a, 11 s 



( 1 ) ARRANGED ALPJIABDTICAELY 



A 


269 


G. 


225 


lu 


354 


Q 


. , 8 


V 




4 


B 


22 


H 


. 450 


M 


15i 


R 




W 




45 


C 


86 


I 


22 


N 


872 


S 


' '962 


X 




116 


D „ 


- 1,002 


J. 


- , 6 


0 . , 


575 


T- 


1,007 


Y 


_ _ 


866 


E 


. 1,628 


K 


53 


P 


213 


U 


ai 


Z 


_ 


9 


F 


. 252 
















— 




















Total 


„10 : 


,000 








(2) ARRANGED ACCORDING TO 


FREQUENCY 








E 


1,628 


R 


769 


F 


252 


C 


86 


I 




22 


T . 


1,007 


0 


. 575 


G 


225 


K 


53 


Z 




9 


D . 


.. 1,002 


H 


450 


P 


213 


W 


45 


Q 




8 


S 


962 


L 


354 


M 


154 


u 


31 


J . 




6 


N .. 


. . 872 


A 


269 


X 


116 


B. 


22 


V . .. 




4 


Y . 


.. 866 




































Total . 


10 


,000 



Table 3 — Relative frequencies of letters appearing in 1,000 letters based upon Table 2-B 



(1) ARRANGED ALPHABETICALLY 



A 


73 


66 


G . _ 


16 


38 


L 


- 36 


42 


Q 


3 


50 


V 


„ 15 32 


B 


9 


74 


H . 


33 


88 


M 


24 


74 


R 


75 


76 


W 


. _ 15 60 


C _ . . 


30 


68 


I 


73 


52 


N 


79 


50 


S . . 


61 


16 


X 


. . 4 62 


D 


_42 


44 


J 


1 


64 


0 


75 


28 


T 


91 


90 


Y 


19 34 


E . . 


129 


96 


K 


2 


96 


P 


26 


70 


U 


26 


00 


Z 


&8 


F... . 


28 


32 






















1 














t 










Total 


1,000 90 








(2) ARRANGED ACCORDING 


TO 


FREQUENCY 






1 


E 


129 


96 


I 


73 


52 


C 


"30 


68 


Y 


19 


34 


X 


. . 4 62 


T . 


91 


90 


S . . 


61 


16 


F 


28 


32 


G 


16 


38 


Q 


. 3 50 


N 


79 


50 


D 


42 


44 


P 


26 


70 


W . 


15 


60 


K 


. 2 96 


R 


75 


76 


L . . 


36 


42 


U 


_ 26 


00 


V 


15 


32 


J 


1 64 


O. . . 


75 


28 


H 


33 


88 


M 


. 24 


74 


B 


9 


74 


Z 


98 


A ... 


73 


66 














































Total 


1,000 00 








(3) VOWELS 








(4) HIGH-FREQUENCY 








A. 








_ 73 


66 






CONSONANTS 










E 








129 


96 


D. 




_ 




42 


44 






I 








73 


52 


N 




— - 


r - 


79 


50 






0 








75' 28 


R 




- 


- 


75 


76 






U 








26 


00 


S 


- 


— 




61 


16 






Y 




— 




19 


34 


T 






— 


91 


90 










Total 




3.97 76 






Total 


350 


76 
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Tablh 3, Contd — Relative frequencies of letters appearing in 1,000 letters based upon Table 2-B 



(6) MEDIUM-FREQUENCY /g\ LOW-FREQUENCY 

CONSONANTS CONSONANTS 



B 


9 74 


X 


4.62 


C . . 


30 68 


Q - 


3 50 


F ... . 


28 32 


K . . 


2 96 


G 


16 38 


J. . 


1 64 


H . .. . _ 


33 88 


Z 


98 


L 


36 42 


— 




M 


24 74 


Total 


13 70 


P 


26 70 


— 




V 


15 32 






W 


16 60 






- 




‘ Total (3), (4), 




Total 


237 78 


(5), (6).. . 1, 


000 00 



Table 4 — Frequency distribution for 10,000 letters of literary English, as compiled by Hitt 1 

(1) ARRANGED ALPHABETICALLY 



A 


778 


G 


174 


L .. 


372 


Q 


8 


V .. . 


112 


B. 


141 


H 


595 


If 


288 


R 


651 


W . .. 


176 


C 


. .. 296 


I 


. . 667 


N . . 


686 


S 


622 


X 


27 


D 


402 


J 


. _ 51 


0 .. . 


807 


T 


855 


Y 


196 


E 


1,277 


K 


... 74 


P 


223 


U . . 


308 


Z ... . 


17 


F 

E 


. .. 197 

. . 1,277 


R 


(2) ARRANGED ACCORDING TO FREQUENCY 
651 U 308 Y_ . . . 196 


K. ... 


74 


T. 


. . . 855 


S 


622 


C.. _. 


296 


W ... 


176 


J 


51 


0. 


807 


H 


696 


M . _ 


288 


G 


174 


X . .. 


27 


A 


778 


D 


... 402 


P... ._ 


223 


B 


141 


z 


17 


N. 


. . . 686 


L. 


372 


F... . 


197 


V 


112 


Q- 


8 



1 667 



Table 6 — Frequency distribution for 10,000 letters of telegraphic English, as compiled by Hitt 1 

(1) ARRANGED ALPHABETICALLY 



A 


813 


G. 


. 201 


L 


392 


Q ..... 


38 


V. . . 


.. 136 


B .. . . 


149 


H . 


386 


M 


273 


R_ .._ 


677 


W . ... 


.. 166 


C. ... 


306 


I_. 


711 


N. .. 


718 


S.. .. 


656 


X ... 


51 


D 


417 


J.. 


42 


O 


844 


T 


634 


Y .. 


. 208 


E 


1,319 


K 


88 


P 


243 


U. 


321 


Z . .. 


6 


F 


205 
























(2) ARRANGED ACCORDING TO 


• FREQUENCY 






E .. . 


1,319 


S.. 


656 


u 


321 


F 


205 


K 


88 


O . 


844 


T. 


634 


c 


306 


G _ 


201 


X 


51 


A_. 


813 


D. 


417 


M 


273 


W 


166 


J . . . 


42 


N . .. 


718 


L_. 


. . . . 392 


P 


243 


B 


149 


Q. .. 


38 


I . . _ 


711 


H.. 


.. .. 386 


Y.. .. 


208 


V... . 


136 


Z ... 


6 


R. 


677 



















i Hitt, Capt Parker Manual for the Solution of Military Ciphers Army Service Schools Press, Fort 
Leavenworth, Kansas, 1916 
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Table 6-A — . Frequency distribution of digraphs, based on 50,000 letters of Governmental plawirtexl 

telegrams, reduced to 5,000 digraphs 



Second Letter 
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Table 6-B — Frequency distribution of digraphs (naval text), based on 20,000 letters of mvcd 

reduced to 2,000 digraphs'- 



A 

B 

c 

D 

E 

F 

G 

H 

I 

J 

K 

« L 
J H 
N 
0 
P 
Q 
R 
S 
T 
U 
V 
W 
X 

y 

z 

Total 

Blanks 



Em 

8 



- Second Letter 



Total 



A BC D EFGHIJKLM N OPQR S T UVWXYZ 



Blanks 



1 


4 


9 


6 




2 


3 


1 


8 




3 


7 


2 


29 




4 




16 


ii 


31 


1 


3 




1 


6 


| 


146 


m 


4 






1 


8 








1 






6 


2 




4 












1 








2 




29 


m 


7 




1 








2 


5 


1 




4 








22 


1 




4 


i 


4 












| 


62 


14 




2 


2 


2 


16 


3 


1 


1 


12 






2 


2 




4 


3 




8 


6 


6 


2 


1 


1 




3 


| 


86 


■E 


9 


3 


8 


24 


25 


7 


1 


2 


7 


1 


1 


6 


6 


34 


6 


E 


1 


43 


23 


18 


1 


7 


2 


4 


1 


E 


254 




2 




1 




2 


1 






13 






5 


1 




12 


1 




2 


1 


5 


1 








1 




48 


12 


4 




1 


1 


8 


1 


1 


11 


2 






2 




1 


2 


1 




2 


2 


6 


3 










i 


49 


9 


6 








7 


1 






6 








1 




3 


1 




7 


1 


11 


6 




1 








51 


14 


2 


1 


6 


2 


2 


5 


11 










8 


2 


42 


21 


2 






n 


11 




9 




6 






149 


9 






























2 
























2 


m 


1 


1 


1 




3 


1 






2 






1 




1 


























11 


ML 


14 


1 


1 




16 


1 






8 






6 






7 


2 




1 


2 


1 


1 


2 






2 




64 


11 


11 


1 






6 








4 






1 


2 




4 


2 






1 












3 




34 




E 


8 


8 


22 


22 


5 


22 


2 


6 




2 


2 


2 


3 


E 


2 




2 


9 


27 


3 




1 








163 


MK 


3 


3 


3 


11 


4 


9 


2 




6 




1 


4 


9 


38 


2 


8 




E 


9 


7 


E 


1 


4 


1 


1 


i 


167 


HE 


4 








18 






1 


1 






6 






7 


3 




8 


3 


2 


1 












53 












































3 












8 


■*T 

mmh 


14 


2 


6 


9 


34 


2 


8 




19 




1 


1 


3 


3 


24 


2 




2 


8 


IE 


4 




1 








148 


HE 


8 


2 


8 


1 


16 


2 




4 


13 






2 


1 


1 


6 


6 


1 


1 


6 


23 


6 




3 








108 


HE 


16 


1 


4 


8 


27 


4 


1 


21 


23 






3 


1 


2 


22 


3 




IE 


8 


8 


4 




12 




8 


4 


185 


IH 


4 


3 


1 


2 


8 




1 




4 






2 


2 


9 




1 




1 


4 
















47 


| 


8 








17 








4 












1 
























26 


22 


4 








IS 






1 


6 












6 






1 


















27 


20 






1 






1 




1 


4 






1 
















2 














10 


20 


3 


1 


2 


1 


2 


8 






1 






3 


2 




2 


2 




l 


2 


2 






1 








28 


11 










m 


































1 










11 


H! 

■M 


IB 


28 


63 


84 


262 


48 


48 


i 


EES 


1 


12 


67 


38 


163 


166 


54 


2 


139 


BS 


184 


57 


24 


26 


11 


26 


E 


1,960 


S 


Li 


12 


9 


13 


4 


EE 


16 


li 


4 


25 




7 


11 


16 


6 


8 


24 


8 


8 


8 


11 


19 


17 


22 


17 


22 


■ 


s 



l Fractional values have been discarded This accounts for the discrepancy between the indicated total (1,960) 
and the stated total (2,000) 
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Tables 7-11, Inclusive 

Absolute frequencies of digraphs, trigraphs, and tetragraphs and the logarithms of their 

assigned probabilities 1 

1 For each of the following 18 tables, the basic data were first arranged according to their 
absolute frequencies (F), and then the logarithms — L W (F) of the frequencies found 

2 The tables are designed to facilitate determination of the relative weights or probability 
of occurrence of ^ets of digraphs, tngraphs, or tetragraphs, particularly with respect to various 
"matching” operations For example, are the matched digraphs RE and ET more probable than 
the matched digraphs RT and EF ? Table 7-A shows the frequencies (F) of the digraphs to be 
as follows RE =98, ET=37, RT=42, EF=18 Therefore, 98 times 37 is compared with 42 
times 18, or 3,626 with 756 This arithmetic method of approach is extremely cumbersome for 
a large number of comparisons By using the logarithms of the individual frequencies, the opera- 
tion is greatly simplified, since the addition of the logarithms of two numbers is equivalent to 
the multiplication of their equivalent arithmetic values Thus, the foregoing computation may 
be expressed as Log 98+Log 37, compared with Log 42+Log 18, or 0 96+0 79 versus 0 81+0 66 
(see Table 7-A and explanation below) If more than one occurrence of a particular digraph is 
involved, it is merely necessary to multiply the logarithmic value by the number of the occur- 
rences, viz , Log X4 2(Log Y)+3(Log Z), as compared with Log A+3(Log B)+2(Log C) 

3 The logarithm of any given number is the power to which 10 must be raised to equal the 
given number Thus, 10 2 = 100, or the logarithm of 100=2 Similarly, 10 s =1,000, or the loga- 
rithm of 1,000 = 3 The sum of logai ithms is equal to the logarithm of the product of their antilogs 
(arithmetic numbers they represent) For example, 10 2 =100, 10*=1000, 10 2+8 =100X1000, Log 
100,000 = 5 Also, 10° = 1, or Log 1=0 The Log of 0 is minus infinity (-») 

4 In the compilation of the logarithms of the elements constituting these tables, frequencies 
of 1, of course, had a logarithmic value of 0 00 Digraphs which did not occur , 2 1 e., those with 
0 occurrences, had a logarithmic value of minus infinity (-«>) For practical use, each of 
the original frequency occurrences in these tables was doubled, i e , EN was given a frequency 
of 222 instead of 111, the frequency of RE became 196 instead of 98, etc Thus, single occur- 
rences were doubled (2x1 = 2), and the logarithms of those elements became 030 instead of 0 
This is equivalent to saying Log 1+Log 2=0 00+0 30=0 30 Those elements which occurred 
0 times, now were assumed to have an occurrence of 1, with an equivalent logarithmic value of 0 00 

5 In order to place all the logarithms of the initial frequencies on a comparable logarithmic 
basis, it was merely necessary to add 0 30 to each of them While EN had a frequency of 111 
m the original compilation, it now had a frequency of 222, or 2(111) The logarithm of 222 is 
2 35 This is equivalent to saying Log 111 +Log 2 = 2 05+0 30 = 2 35 

6 The frequencies as stated in terms of their actual logarithms do not readily indicate their 
relative size for each distribution Therefore, the highest frequency m each group was given a 
value of 0 99, and the lowest a value of 0, frequencies intermediate between these extremes were 

* These frequency distributions are based upon data derived from 60,000 letters o{ U S Governmental plain-text telegrams, reduced to B 000 
digraphs 

* "While in general it is possible to assign probability values to digraphs in accordance with their observed frequencies, it is not strictly correct 
to associate the probability "0 with a frequency of zero This would bo equivalent to saying Because a specified digraph has not occurred, it 
cannot occrn and would be reflected in the mathematics Log probability zero equals minus infinity ” What may be said fa 'Since a specified 
digraph hns not occurred in the data its true probability value is unknown except that it must be below the probability value assigned to a fre- 
quency of one ” The proper way to assign a probability value to digraphs with frequencies of zero is to continue counting until they have at 
least one occurrence then the true relative probability can be found 

A simple practical method of taking thiB difficulty into account fa merely to assume that in twice the amount of data the digraph probably 
would Imvc occurred at least once that fa it has a frequency of one-half 

It should be pointed out however, that since probabilities are multiplied (by summing logarithms) a 10% error in evaluating the digraph ZZ 
for example, makes the product, wherever ZZ occurs 10% wrong and is just as senous as a 10% error in evaluating the high frequency digraph EN 

In practice however results obtained from thn logarithmic method are so satisfactory that refinements are not needed 
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evaluated in proportion tp their respective frequencies This is equivalent to expressing the 
frequencies in logarithms with a base other than 10 In other words, this procedure of converting 
the logarithms to the range from 00 to 99 consists m dividing up the original range of logarithms 
t into 100 equal parts and assigning each one to the proper rank m the range 

7 The new base (C) used to convert each of the digraphic frequencies to the logarithmic 
range 0 to 0 99 is derived as follows, when 222 is the highest frequency (F) 

Let 222 =C° 88 
Logjo 222= Login C° 88 
Logio 222 = (0 99) (Log™ C) 

n Logio222 ... 2 35 

C = Antilog jj-gg Antilogy 

C=224 



8 The formula for the computation of the logarithm to the new base (C) of any actual 
frequency (Y) of a series is 



Log„ Y = 



Login Y 
Login U 



It is more expeditious to use reciprocals m the conversion of a whole series of logarithmic values, 
as m this instance The formula is (Login C)" 1 (Login Y) = Log 0 Y 

9 The digraphic index chart, Table 15, on page 37, summarizes the logarithmic frequencies 
of all English plain-text digraphs, computed to a base of 224 so that the logarithm of the highest 
frequency (EN) is 0 99 

Example EN=222 

Logm 222=2 35 

(Login C)- l = (Login 224)-»=0 421 
Log. 222 = 0 421 X2 35 = 0 99 

10* Likewise, the trigraphs and tetragraphs have been computed to the 
"bases L 586 and L 2 h 4 , respectively, so that the logarithms of the highest- 
frequency trigraph (EOT) and tetragraph (TI0N) are 0.99* Since no use is 
"being made of the trigraphs appearing less than 100 times and tetragraphs 
appearing less than 50 times, the basic frequencies of the trigraphs and 
tetragraphs have not been doubled in computing the new bases of the loga- 
rithms . 
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Table 7-A — The i%8 different digraphs of Table 6-A, arranged according to their absolute frequencies, 
accompanied by the logarithms of their assigned probabilities 



F 


L»(r) 


mgrrm 

■nai 


F 


L»(F) 




F 


L»(F) 


H'i/B 


F 


Lw(F) 


JLau 

<2F) 


EN.. 


Ill 


2 


05 


99 


DA.. 


32 


1 


51 


76 


OL . 


19 


1 


28 


67 




12 


1 


08 


58 


RE .. 


98 


1 


99 


96 


EC.. 


32 


1 


51 


76 


0T_. 


19 


1 


28 


67 




12 


1 


08 


58 


ER.. 


87 


1 


94 


94 


RS . 


31 


1 


49 


75 


SS 


19 


1 


28 


67 


SF... 


12 


1 


08 


58 


NT.. . 


82 


1 


91 


93 


UR. _ 


31 


1 


49 


75 


TS. . 


19 


1 


28 


67 


US.... 


m 


1 


08 


58 


TH. . 


78 


1 


89 


92 


HI. . 


30 


1 


48 


75 


TT . 


19 


1 


28 


67 


UT .. 


12 


1 


08 


58 


ON .. 


77 


1 


89 


92 


RI. 


30 


1 


48 


75 


wo .. 


19 


1 


28 


67 


VI.. 


12 


1 


08 


58 


IN ... 


75 


1 


88 


92 


EL. 


29 


1 


46 


74 


BE . 


18 


1 


26 


66 


WA._ 


12 


1 


08 


58 


TE. . 


71 


1 


85 


91 


HT.. 


28 


1 


45 


74 


EF... 


18 


1 


26 


66 


FF_... 


11 


1 


04 


56 


AN. _ 


64 


1 


81 


89 


LA.. 


28 


1 


45 


74 


NO . 


18 


1 


26 


66 


FT.... 


11 


1 


04 


56 


OR .. 


64 


1 


81 


89 


R0.. 


28 


1 


45 


74 


PR. .. 


18 


1 


26 


66 


PP.. 


11 


1 


04 


56 


ST.. 


63 


1 


80 


88 


TA . 


28 


1 


45 


74 


AI 


17 


1 


23 


64 


RR.. 


11 


1 


04 


56 


ED. _ 


60 


1 


78 


88 


*2.495 








HR... 


17 


1 


23 


64 


SU. 


11 


1 


04 


66 


NE... 


57 


1 


76 


87 












F0.. 


17 


1 


23 


64 


UE__ 


11 


1 


04 


56 


VE... 


67 


1 


76 


87 


AD . 


27 


1 


43 


73 


RD__ 


17 


1 


23 


64 


YF . 


11 


1 


04 


56 


ES ... 


54 


1 


73 


86 


DI _ 


27 


1 


43 


73 


TR._ 


17 


1 


23 


64 


YS... 


11 


1 


04 


56 


ND.... 


62 


1 


72 


86 


El . 


27 


1 


43 


73 


DO .. 


16 


1 


20 


63 


FE.. 








55 


TO 


60 


1 


70 


84 


IR.. 


27 


1 


43 


73 


DT.. 


16 


1 


18 


62 


IF.. 








55 


SE. . 


49 


1 


69 


84 


IT . 


27 


1 


43 


73 


IX . 


15 


1 


18 


62 


LY. . 








55 


1 1,249 








LL_. 


27 


1 


43 


73 


QU . 


15 


1 


18 


62 


MO„ . 








55 












NG . 


27 


1 


43 


73 


SO .. 


15 


1 


18 


62 


SP ... 








55 


AT... 


47 


1 


67 


83 


ME 


26 


1 


41 


72 


YT. 


15 


1 


18 


62 










55 


TI_ . 


45 


1 


65 


82 


NA. . 


26 


1 


41 


72 


AC.. 


14 


1 


15 


61 


FR. _ 


9 


0 


95 


58 


AR . 


44 


1 


64 


82 


SH 


26 


1 


41 


72 


AM 


14 


1 


15 


61 


IM.. 


9 


0 


95 


53 


EE... 


42 


1 


62 


81 


IV.. 


25 


1 


40 


72 


CH . 


14 


1 


15 


61 


LD_. 


9 


0 


95 


53 


RT _ 


42 


1 


62 


81 


OF.. 


25 


1 


40 


72 


CT.. 


14 


1 


15 


61 


MI.. 


9 


0 


95 


63 


AS... 


41 


1 


61 


80 


0M 


25 


1 


40 


72 


EM .. 


14 


1 


15 


61 


NF... 


9 


0 


95 


53 


CO... 


41 


1 


61 


80 


OP 


26 


1 


40 


72 


GE . 


14 


1 


15 


61 


RC... 


9 


0 


95 


53 


10... 


41 


1 


61 


80 


NS. . 


24 


1 


38 


71 


OS . 


14 


1 


15 


61 


RM._ 


9 


0 


95 


53 


TY.... 


41 


1 


61 


80 


SA 


24 


1 


38 


71 


PA .. 


14 


1 


15 


61 


RY_. 


9 


0 


95 


53 


F0.„. 


40 


1 


60 


80 


IL . 


23 


1 


36 


70 


AU . 


13 


1 


11 


59 


YE_ . 


9 


0 


95 


53 


FI.... 


39 


1 


59 


80 


PE. 


23 


1 


36 


70 


DS 


13 


1 


11 


59 


DD 


8 


0 


nil 


51 


RA . 


39 


1 


59 


80 


IC .. 


22 


1 


34 


69 


IE 


13 


1 


11 


59 


DF... 


8 


0 


90 


51 


ET ._ 


37 


1 


67 


79 


WE.. 


22 


1 


34 


69 


LO . 


13 


1 


11 


59 


HU _ 


8 


0 


nn 


51 


LE 


37 


1 


57 


79 


UN... 


21 


1 


32 


68 


MM. 


13 


1 


11 


59 


IA._ 


8 


0 


90 


51 


OU.. 


37 


1 


57 


79 


CA. 


20 


1 


30 


67 


PL.. 


13 


1 


11 


59 


LT... 


8 


0 




61 


MA.. 


36 


1 


56 


78 


EP 


20 


1 


30 


67 


RP_ . 


13 


1 


11 


69 


MP... 


8 


0 




61 


TW.. 


36 


1 


56 


78 


EV 


20 


1 


30 


67 


SC.... 


13 


1 


11 


59 


NN... 


8 


0 




51 


EA... 


35 


1 


54 


78 


GH... 


20 


1 


30 


67 


WI .. 


13 


1 


11 


59 


OC ... 


8 


0 




51 


IS ... 


35 


1 


54 


78 


HA... 


20 


1 


30 


67 


’3.745 








0W_. 


8 


0 


90 


51 


SI.. 


34 


1 


53 


77 


HE 


20 


1 


30 


67 












PT. . 


8 


0 




51 


DE .. 


33 


1 


52 


77 


HO... 


20 


1 


30 


67 


AP . 


12 


1 


08 


58 


UG.. 


8 


0 




51 


HI _ 


33 


1 


52 


77 


LI 


20 


1 


30 


67 


AY . 


12 


1 


08 


58 




7 


0 


85 


48 


AL . 


32 


1 


51 


76 


IG . 


19 


1 


28 


67 


DR 


12 


1 


08 


58 


BY 


7 


0 


85 


48 


CE. 


32 


1 


51 


76 


NC . 


19 


1 


28 


67 


EO . 


12 


1 


08 


58 


Cl.. 


7 


0 


85 


48 



* The 18 digraphs abou* thu line compose -o c o of tl c total * The 122 digraphs abo\e this line compo e 75% of the total 

a Tht 53 digraphs above this lino compose 60^ of the total 
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Table 7-A, Contd — The h28 different digraphs of Table 6-A, 
absolute frequencies, accompanied by the logarithms of their 



3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
3 0 48 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 
2 0 30 



arranged according to their 




F 


Lu(F) 


RU.. 


5 


0 


70 


RV . 


5 


0 


70 


SD.._ 


6 


0 


70 


SR . 


5 


0 


70 


TL_. 


5 


0 


70 


TO.. 


6 


0 


70 


UA . 


5 


0 


70 


UI 


5 


0 


70 


UM . 


5 


0 


70 


AF 


4 


0 


60 


BA . 


4 


0 


60 


BO 


4 


0 


60 


CK . 


4 


0 


60 


CR . 


4 


0 


60 


CU.. 


4 


0 


60 


DB . 


4 


0 


60 


DC . 


4 


0 


60 


DN . 


4 


0 


60 


DW... 


4 


0 


60 


EB . 


4 


0 


60 


EG... 


4 


0 


60 


EY. _ 


4 


0 


60 


GT . 


4 


0 


60 


HS . 


4 


0 


60 


MS . 


4 


0 


60 


NH.. 


4 


0 


60 


NR.. 


4 


0 


60 


OB .. 


4 


0 


60 


PM _ 


4 


0 


60 


RW .. 


4 


0 


60 


SN . 


4 


0 


60 


SW .. 


4 


0 


60 


WH ._ 


4 


0 


60 


YC .. 


4 


0 


60 


YD. . 


4 


0 


60 


YR . 


4 


0 


60 


AA . 


3 


0 


48 


AW. . 


3 


0 


48 


cc . 


3 


0 


48 


DL .. 


3 


0 


48 


DV . 


3 


0 


48 


EU . 


3 


0 


48 


FS . 


3 


0 


48 


FU 


3 


0 


48 


GN . 


3 


0 


48 




F Lii(F> . (20 



2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 25 
2 0 30 25 

2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 26 

2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 
2 0 30 25 

2 0 30 25 

1 0 00 13 

0 00 13 

0 00 13 

0 00 13 
0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 

0 00 13 
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Table 7-A, Concluded — The 1$8 different digraphs of Table 6-A, arranged according to their 
absolute frequencies, accompanied by the logarithms of their assigned probabilities 



F 


L»(T) 


Ll24 

(zr) 




Lu(F) 


mesm 

pa 


F 


Ln(F) 




F 


U(F) 


L*» 

(2F) 


CY. . 


1 


0 


00 


13 


HW 


1 


0 


00 


13 


PD . 


1 


0 


00 






■ 


0 


00 


13 


DJ _ 


1 


0 


00 


13 


HY... 


1 


0 


00 


13 


PN . 


1 


0 


00 








0 


00 


13 


DY _ 


1 


0 


00 


13 


JA . 


1 


0 


00 


13 


PV 


1 


0 


00 


13 




k 


0 


00 


13 


EJ . 


1 


0 


00 


13 


KA... 


1 


0 


00 


13 


PW .. 


1 


0 


00 


13 




m 


0 


00 


13 


EZ ... 


1 


0 


00 


13 


KC.. 


1 


0 


00 


13 


PY _ 


1 


0 


00 


13 




» 


0 


00 


13 


FD. _ 


1 


0 


00 


13 


KL__ 


1 


0 


00 


13 


QM 


1 


0 


00 


13 


XE. 


1 


0 


00 


13 


FG . 


1 


0 


00 


13 


KN._ 


1 


o 


00 


13 


QR _ 


1 


0 


00 


13 


X 

t 

i 

i 


1 


0 


00 


13 


FM.. 


1 


0 


00 


13 


KS . 


1 


0 


00 


13 


RJ . 


1 


0 


00 


13 


XH-. 


l 


0 


00 


13 


FP . . 


1 


0 


00 


13 


LG .. 


1 


0 


00 


13 


RK 


1 


0 


00 


13 


XN . 


l 


0 


00 


13 


FW-- 


1 


0 


00 


13 


LH... 


1 


0 


00 


13 


SK . 


1 


0 


00 


13 


XO.._. 


l 


0 


00 


13 


FY.. 


1 


0 


00 


13 


LN . 


1 


0 


00 


13 


SV . 


1 


0 


00 


13 


XR .. 


l 


0 


00 


13 


GD... 


1 


0 


00 


13 


MD._ 


1 


0 


00 


13 


SY „ 


1 


0 


00 


13 


XS _ 


l 


0 


00 


13 


GG... 


1 


0 


00 


13 


MF . 


1 


0 


00 


13 


TG.. 


1 


0 


00 


13 


YG ... 


l 


0 


00 


13 


GJ ._ 


1 


0 


00 


13 


MH .. 


1 


0 


00 


13 


TQ . 


1 


0 


00 


13 


YH.. 


l 


0 


00 


13 


GM . . 


1 


0 


00 


13 


NJ... 


1 


0 


00 


13 


TZ 


1 


0 


00 


13 


YU ._ 


l 


0 


00 


13 


GW 


1 


0 


00 


13 


NQ.. 


1 


0 


00 


13 


UF. 


1 


0 


00 


13 


YW_ . 


l 


0 


00 


13 


HB.. 


1 


0 


00 


13 


OJ.. 


1 


0 


00 


13 


UO . 


1 


0 


00 


13 


ZA . 


1 


0 


00 


13 


HL.. 


1 


0 


00 


13 


OX 


1 


0 


00 


13 


UV . 


1 


0 


00 


13 


ZI .. 


l 


0 


00 


.13 


HP.. 


1 


0 


00 


13 


PB.. 


1 


0 


00 


13 


VO . 


1 


0 


00 


13 


5.000 








HQ .. 


1 


0 


00 




PC . 


1 


0 


00 


13 


L VT 


1 


0 


00 


13 




_ 









Table 7-B — The 18 digraphs composing 25% of the digraphs m Table 6-A, accompanied by the 
logarithms of their assigned probabilities, arranged alphabetically according to their initial letters 



(1) AND ACCORDING TO THEIR FINAL (2) AND ACCORDING TO THEIR ABSOLUTE 

LETTERS FREQUENCIES 



F 


Lio(F) 


wm 


' 


Lm(T) 


"Ta" 

U*> 


r 


L»CF) 


<2F) 


F 


LiofP) 


L»324 

<2F) 


AN. 


. 64 


1 


81 


89 


ON. 


.. 77 


□ 


89 


92 


AN 


. 64 


1 


81 


89 


ON.. 


77 


l 


89 


92 












OR. 


. 64 


Q 


81 


89 












OR .. 


64 


1 


81 


.89 


ED. 


.. 60 


1 


78 


88 


RE. 


i 

CD 

00 


1 


99 


96 


EN. 


..111 


2 


05 


99 


RE 


98 


l 


99 


96 


EN 


..111 


2 


05 


99 












ER. 


. 87 


1 


94 


94 












ER 


.. 87 


1 


94 


94 


SE 


. 49 


l 


69 


84 


ED 


_ 60 


1 


78 


88 


ST . 


63 


l 


80 


88 


ES 


_ 64 


1 


73 


86 


ST. 


_ 63 


l 


80 


88 


ES. 


. 54 


1 


73 


86 


SE _ 


49 


l 


69 


84 












TE. 


. 71 


l 


85 


91 












TH 


78 


l 


89 


92 


IN 


... 75 


1 


88 


92 


TH. 


. 78 


l 


89 


92 


IN 


.. 75 


1 


88 


92 


TE.. 


71 


l 


85 


91 












TO. 


50 


1 


70 


84 












TO.... 


50 


l 


70 


84 


ND 


. 52 


1 


72 


85 


VE. 


57 


l 


76 


87 


NT.. 


. 82 


1 


91 


VI 


VE.. 


57 


l 


76 


87 


NE 


57 


1 


76 


87 


1 


,249 








NE 


. 57 


1 


76 


87 


1, 


249 








NT 


... 82 


1 


91 


93 


l 










ND. 


52 


1 


71 


85 
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Tablb 7-C — The 58 digraphs composing 50% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their initial letters 



(1) AND ACCORDING TO THEIR FINAL (2) AND ACCORDING TO THEIR ABSOLUTE 

LETTERS FREQUENCIES 



F 


Lm(F) 


mum 


F 


L»<n 


Kin 


r 


L»<r> 


'msm 


r 


L|.(D 


Lou 

m 


AL.. 


32 


1 


61 


76 


MA . 


36 


1 


56 


78 


AN.. 


64 




81 


89 


MA... 


36 


1 


56 


78 


AN... 


64 


1 


81 


89 












AT . 


47 




67 


83 












AR.. 


44 


1 


64 


82 


ND. 


52 


1 


72 


85 


AR . 


44 




64 


82 


NT 


82 


1 


91 


93 


AS. 


41 


1. 


61 


80 


NE . 


57 


1 


76 


87 


AS „ 


41 




61 


80 


NE.. 


57 


1 


76 


87 


AT... 


47 


1 


67 


83 


NI . 


30 


1 


48 


75 


AL . 


32 


1 


51 


76 


ND.... 


52 


1 


72 


85 












NT. 


82 


1 


91 


93 












NI.... 


30 


1 


48 


75 


CE ... 


32 


1 


61 


76 












CO 


41 


1 


61 


80 












CO__ 


41 


1 


61 


80 


ON.. 


77 


1 


89 


92 


CE . 


32 


1 


51 


76 


ON .. 


77 


1 


89 


92 












OR . 


64 


1 


81 


89 












OR . 


64 


1 


81 


89 


DA... 


32 


1 


61 


76 


OU . 


37 


1 


57 


79 


DE.. 


33 


1 


52 


77 


OU_. 


37 


1 


57 


79 


DE._ 


33 


1 


52 


77 












DA.. 


32 


1 


51 


76 






















RA„ 


39 


1 


59 


80 












RE.... 


98 


1 


99 


96 


EA .. 


36 


1 


54 


78 


RE . 


98 


1 


99 


96 


EN 111 


2 


05 


99 


RT.. 


42 


1 


62 


81 


EC . 


32 


1 


51 


76 


RI-. 


30 


1 


48 


75 


ER 


87 


1 


94 


94 


RA... 


39 


1 


59 


.80 


ED__ 


60 


1 


78 


88 


RO 


28 


1 


46 


74 


ED_ . 


60 


1 


78 


88 


RS . 


31 


1 


49 


76 


EE 


42 


1 


62 


81 


RS... 


31 


1 


49 


75 


ES .. 


54 


1 


73 


86 


RI-. 


30 


1 


48 


75 


EL.. 


29 


1 


46 


74 


RT.. 


42 


1 


62 


81 


EE „ 


42 


1 


62 


81 


RO.. 


28 


1 


45 


74 


EN ._ 


Lll 


2 


05 


99 












ET. _ 


37 


1 


57 


79 












ER.. 


87 


1 


94 


94 


SE.. 


49 


1 


69 


84 


EA . 


35 


1 


54 


78 


ST ... 


63 


1 


80 


88 


ES .. 


54 


1 


73 


86 


SI ... 


34 


1 


53 


77 


EC. . 


32 


1 


51 


76 


SE . 


49 


1 


69 


84 


ET 


37 


1 


57 


79 


ST . 


63 


1 


80 


88 


EL. . 


29 


1 


46 


74 


SI ... 


34 


1 


53 


77 


FI._ 


39 


1 


59 


80 


TA . 


28 


1 


45 


74 


FO 


40 


1 


60 


80 


TH... 


78 


1 


89 


92 


FO 


40 


1 


60 


.80 


TE.. 


71 


1 


85 


91 


FI 


39 


1 


59 


80 


TE_ . 


71 


1 


85 


91 












TH. . 


78 


1 


89 


92 












TO 


50 


1 


70 


84 


HI... 


33 


1 


52 


77 


TI.. 


45 


1 


65 


82 


HI _ 


33 


1 


52 


77 


TI.. 


45 


1 


65 


82 


HT_ . 


28 


1 


46 


74 


TO ._ 


60 


1 


70 


84 


HT. _ 


28 


1 


45 


74 


TY __ 


41 


1 


61 


80 












TW.. 


36 


1 


56 


78 












TW ... 


36 


1 


56 


78 


IN . 


76 


1 


88 


92 


TY . 


41 


1 


61 


80 


IN _ 


75 


1 


88 


92 


TA. 


28 


1 


45 


74 


10 . 


41 


1 


61 


80 












10 .. 


41 


1 


61 


80 












IS . 


35 


1 


54 


78 


UR .. 


31 


1 


49 


75 


IS.. 


35 


1 


54 


78 


UR.. 


31 


1 


49 


75 


LA... 


28 


1 


45 


74 


VE. . 


57 


1 


76 


87 


LE. . 


37 


1 


57 


79 


VE _ 


67 


1 


76 


87 


LE.. 


37 


1_ 


57 


79 


2,495 








LA 


28 


1_ 


45 


74 


2,495 
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Tabic 7-D — The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, accompanied 
by the logarithms of their assigned probabilities, arranged alphabetically according to their 
initial letters 



(1) AND ACCORDING TO THEIR PINAL LETTERS 
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Table 7-D, Concluded — The digraphs composing 7f$% of the ~5$$b 'digraphs of Table *6- A, 

accompanied by the logarithms of their assigned probabilities, arranged alphabetically according 
to their initial letters 



(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES 




Table 7-E — All the 428 digraphs of Table 6-A, arranged first alphabetically according to their 
initial hlteis and then alphabetically according to then final letters 

(SEE TABLE 6-A— READ ACROSS THE ROWS) 
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Table 8 —The I&8 different digraphs of Table 6-A, arranged first alphabetically according to their 
initial letters and then according to their absolute frequencies under each initial letter accompanied 
by the logarithms of their assigned probabilities 



F 


L»(D 


Xi*M 

(2D 


r 


i»<r) 


K&l 


r 


Lio(F) 


JL>ni 1 

(SF) : 


F 


1 Liotrj | 


(2D 


AN . 


64 


1 


81 


89 


CT . 


14 


n 


15 


61 


ED 




GO 


1 


78 


88 


GH.. 


20 


1 


m 


67 


AT.... 


47 


1 


67 


83 


Cl . 


7 




85 


48 


ES 


„ 


54 


l 


73 


86 


GE. 


14 


1 


15 


61 


AR... 


44 


1 


64 


82 


CL.. 


5 




70 


42 


EE. 




42 


1 


62 


81 


GA . 


7 


□ 


85 


48 


AS.... 


41 


1 


61 


80 


CK . 


4 


0 


60 


38 


ET 


. 


37 


1 


57 


79 


GO. . 


6 


0 


78 


45 


AL . 


32 


1 


61 


76 


CR.. 


4 


0 


60 


38 


EA 


... 


35 


1 


54 


78 


GI .. 


5 


0 


70 


42 


AD. . 


27 


1 


43 


73 


CU . 


4 


0 


60 


38 


EC 




32 


1 


51 


76 


GR .. 


6 


0 


70 


42 


AI 


17 


1 


23 


64 


CC .. 


3 


0 


48 


33 


EL 




29 


1 


46 


74 


GT 


4 


0 


60 


38 


AC.. 


14 


1 


15 


61 


CD... 


1 


0 


00 


13 


El 




27 


l 


43 


73 


GN._ 


3 


0 


48 


33 


AM... 


14 


1 


15 


61 


CF... 


1 


0 


00 


13 


EP 


... 


20 


1 


30 


67 


GS... 


3 


0 


48 


33 


AU . 


13 


1 


11 


59 


CM . 


1 


0 


00 


13 


EV 




20 


l 


30 


67 


GC ... 


2 


0 


30 


25 


AP.... 


12 


1 


08 


58 


CN .. 


1 


0 


00 


13 


EF. 




18 


l 


26 


66 


GF.. 


2 


0 


30 


25 


AY . 


12 


1 


08 


58 


CS.. 


1 


0 


00 


13 


EM 


. 


14 


l 


15 


61 


GL._ 


2 


0 


30 


25 


AV.. 


7 


0 


85 


48 


cw . 


1 


0 


00 


13 


EO. 




12 


1 


08 


58 


GP_. 


2 


0 


30 


25 


AB 


6 


0 


78 


45 


CY.. 


1 


0 


00 


13 


EQ 


_ 


12 


l 


08 


58 


GU... 


2 


0 


30 


25 


AG .. 


6 


0 


78 


45 












EH 




7 


0 


85 


48 


GD 


1 


0 


00 


13 


AF... 


4 


0 


60 


38 


DE 


33 


1 


52 


77 


EW 




7 


0 


85 


48 


GG . 


1 


0 


00 


13 


AA _ 


3 


0 


48 


S3 


DA . 


32 


1 


51 


76 


EX 




7 


0 


85 


48 


GJ ... 


1 


0 


00 


13 


AW.... 


3 


0 


48 


33 


DI 


27 


1 


43 


i 73 


EB 


_ 


4 


0 


60 


38 


GM... 


1 


0 


00 


13 


AH _ 


2 


0 


30 


26 


DO... 


16 


1 


20 


63 


EG 




4 


0 


60 


38 


GW ... 


1 


0 


00 


13 


AK.. 


2 


0 


30 


25 


DT . 


15 


1 


18 


62 


EY 


_ 


4 


0 


60 


38 












AO .. 


2 


0 


30 


25 


DS 


13 


1 


11 


59 


EU 




3 


0 


48 


33 












AE .. 


1 


0 


00 


13 


DR . 


12 


1 


08 


58 


EJ 


... 


1 


0 


00 


13 












AJ_ . 


1 


0 


00 


13 


DD.. 


8 


0 


90 


1 51 


EZ 





1 


0 


00 


13 


HI. . 


33 


1 


52 


77 












DF. . 


8 


0 


90 


' 51 














HT. . 


28 


1 


45 


74 


BE 


18 


1 


26 


66 


DM . 


5 


iO 


70 


42 


FO. 




40 


1 


60 


80 


HA 


20 


1 


30 


67 


BY... 


7 


0 


85 


48 


DP . 


5 


0 


70 


42 


FI 




39 


1 


59 


80 


HE... 


20 


1 


30 


67 


BL... 


6 


0 


78 


45 


DU.. 


5 


0 


70 


42 


FF 


. 


11 


1 


04 


56 


HO... 


20 


1 


30 


67 


BA 


4 


0 


60 


38 


DB 


4 


0 


60 


38 


FT 




11 


I 1 


04 


56 


HR . 


17 


1 


23 


64 


BO ... 


4 


0 


60 


38 


DC . 


4 


0 


60 


38 


FE 


.. 


10 


T 


00 


65 


HU.. 


8 


0 


90 


51 


BI. . 


2 


0 


30 


25 


DN . 


4 


0 


60 


38 


FR 




9 


0 


95 


53 


HF .. 


5 


0 


70 


42 


BR... 


2 


0 


30 


25 


DW„ . 


4 


0 


60 


38 


FA. 





5 


0 


70 


42 


HS_. 


4 


0 


60 


38 


BU 


2 


0 


30 


25 


DL.. 


3 


0 


48 


33 


FS 




3 


0 


48 


33 


HC__ 


3 


0 


48 


33 


BJ. . 


1 


0 


00 


13 


DV.. 


3 


0 


48 


*53 


FU 


_ 


3 


0 


48 


33 


HN . 


3 


0 


48' 


33 


BM .. 


1 


0 


00 


13 


DG . 


2 


0 


30 


25 


FC 


... 


2 


0 


30 1 


25 i 


HD... 


2 


0 


30 1 


25 


BS .. 


1 


0 


00 


13 


DH 


2 


0 


30 


25 


FL 




2 


0 


30 


25 


HM .. 


2 


0 


30 


25 


BT. 


1 


0 


00 


13 


DQ. .. 


2 


0 


30 


25 


FD 


... 


1 


0 


00 1 


13 


HB._ 


1 


0 


00 


13 












DJ.. 


Jl 


0 


00 


13 


FG 


. 


1 


0 


oo ! 


13 


HL . 


1 


0 


00 


13 


CO . 


41 


1 


61 


80 


DY.. 


1 


0 


00 


13 


FM 




1 


0 


00 


13 


HP .. 


1 


0 


00 


13 


CE 


32 


1 


51 


76 










i 


FP 


_ _ 


1 


0 


00 


13 


HQ . 


1 


0 


00 


13 


CA 


20 


1 


30 


67 


EN. Ill 


2 


05 


99 


FW 




1 


0 


00 


13 


HW.. 


1 


0 


00 


13 


CH _ 


14 


1 


15 


61 


ER. . 


87 


1 


94 


94 

l 


FY 


- 


1 


0 


00 


13 

i 


HY. . 


1 


0 


00 


13 



1 For arrangement alphabetically first under initial letters and then under final letters, see Table 6-A 
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Table 8, Contd — The £28 different digraphs of Table "S-A, arranged first alphabetically according 
to their initial letters and then according to their absolute frequencies under each initial letter , 1 
accompanied by the logarithms of their assigned probabilities 




1 For ainngement alphabetically first under initial letters and then under final letters, see Table 6-A 
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Table 8, Concluded — The J$8 different digraphs of Table 6~A, arranged first alphabetically accord- 
ing to their initial letters and then according to their absolute frequencies under each initial letter, 1 
accompanied by the logarithms of their assigned probabilities 



F 


Lio(r) 


m i 


■■O 


LioCF) 


wm 


F 


Ltii(F) 




F 


L»(F) 


Ll24 

(2F) 


HE... 


98 


1 


99 


96 


SR.. 


5 




70 


42 


US .. 


12 


1 




58 


XI ... 


2 


0 


30 


25 


RT .. 


42 


1 


62 


81 


SN . 


4 




60 


38 


UT. .. 


12 


1 




58 


XP _ 


2 


0 


30 


25 


RA .. 


39 


1 


59 


80 


SW .. 


4 




60 


38 


UE_. 


11 


1 




56 


XD 


1 


0 


00 


13 


RS 


31 


1 


49 


75 


SB... 


3 




48 


33 


UG.. 


8 






51 


XE... 


1 


0 


00 


13 


RI . 


30 


1 


48 


75 


SM.. 


3 


0 


48 


33 


UL. . 


6 




78 


45 


XF.... 


1 


0 


00 


13 


R0-. 


28 


1 


45 


74 


SG .. 


2 


0 


30 


25 


UA.. 


5 


0 


70 


42 


XH... 


1 


0 


00 


13 


RD... 


17 


1 


23 


64 


SL .. 


2 


0 


30 


25 


UI__ 


5 


0 


70 


42 


XN.. 


1 


0 


00 


13 


RP.. 


.13 


1 


11 


69 


SK.. 


1 


0 


00 


13 


UM . 


5 


0 


70 


42 


XO.. 


1 


0 


00 


13 


RR.. 


11 


1 


04 


56 


SV.... 


1 


0 


00 


13 


UB.. 


3 


0 


48 


33 


XR. 


1 


0 


00 


13 


RC.. 


9 


0 


95 


53 


SY.. 


1 


0 


00 


13 


UC ... 


3 


0 


48 


33 


XS.. 


1 


0 


00 


13 


RM... 


9 


0 


95 


53 












UD. _ 


3 


0 


48 


33 












RY 


9 


0 


95 


63 


TH . 


78 


1 


89 


92 


UP.... 


2 


0 


30 


25 


YT . 


15 


1 


18 


62 


RG .. 


7 


0 


85 


48 


TE.. 


71 


1 


85 


91 


UF 


1 


0 


00 


13 


YF 


11 


1 


04 


56 


RN . 


7 


0 


85 


48 


TO. . 


60 


1 


70 


84 


UO... 


1 


0 


00 


13 


YS. _ 


11 


1 


04 


66 


RF . 


6 


0 


78 


45 


TI.. 


45 


1 


65 


82 


UV . 


1 


0 


00 


13 


YO ... 


10 


1 


00 


55 


RL... 


5 


0 


70 


42 


TY . 


41 


1 


61 


80 












YE... 


9 


0 


95 


53 


RU... 


5 


0 


70 


42 


TW... 


36 


1 


56 


78 


VE . 


57 


1 


76 


87 


YA_. 


6 


0 


78 


45 


RV . 


5 


0 


70 


42 


TA. 


28 


1 


45 


.74 


VI .. 


12 


1 


08 


58 


YN... 


6 


0 


78 


45 


RW.. 


4 


0 


60 


38 


TS .. 


19 


1 


28 


67 


VA . 


6 


0 


78 


45 


YC... 


4 


0 


60 


38 


RH .. 


3 


0 


48 


33 


TT ... 


19 


1 


28 


67 


VO.. 


1 


0 


00 


13 


YD.. 


4 


0 


60 


38 


RB 


2 


0 


30 


25 


TR. _ 


17 


1 


23 


64 


VT. . 


1 


0 


00 


13 


YR_._. 


4 


0 


60 


38 


RJ ... 


1 


0 


00 


13 


TF . 


7 


0 


85 


48 












YI_ . 


3 


0 


48 


33 


RK.. 


1 


0 


00 


13 


TN.. 


7 


0 


85 


48 


WE .. 


22 


1 


34 


69 


YP .. 


3 


0 


48 


33 












TC . 


6 


0 


78 


45 


WO. 


19 


1 


28 


67 


YB... 


2 


0 


30 


25 


ST... 


63 


1 


80 


88 


TD . 


6 


0 


78 


45 


WI .. 


13 


1 


11 


59 


YL .. 


2 


0 


30 


25 


SE ... 


49 


1 


69 


84 


TM . 


6 


0 


78 


45 


WA . 


12 


1 


08 


58 


YM_. 


2 


0 


30 


25 


SI.... 


34 


1 


53 


77 


TL . 


5 


0 


70 


42 


WH . 


4 


0 


60 


38 


YG_. 


1 


0 


00 


13 


SH... 


26 


1 


41 


72 


TU 


5 


0 


70 


42 


WN .. 


2 


0 


30 


25 


YH.__ 


1 


0 


00 


13 


SA .. 


24 


1 


38 


71 


TB._ 


3 


0 


48 


33 


WL_. 


1 


0 


00 


13 


YU... 


1 


0 


00 


13 


SS.... 


19 


1 


28 


67 


TP.. 


2 


0 


30 


25 


WR 


1 


0 


00 


13 


YW_. 


1 


0 


00 


13 


S0_.. 


15 


1 


18 


62 


TG . 


1 


0 


00 


13 


ws._ 


1 


0 


00 


13 












SC.... 


13 


1 


11 


59 


TQ... 


1 


0 


00 


13 


WY.. 


1 


0 


00 


13 


ZE__ 


2 


0 


30 


25 


SF... 


12 


1 


08 


58 


TZ .. 


1 


0 


00 


13 












ZA__„ 


1 


0 


00 


13 


SU .. 


11 


1 


04 


56 












XT... 


7 


0 


85 


48 


ZI._ 


1 


0 


00 


13 


SP.... 


10 


1 


00 


55 


UR... 


31 


1 


49 


75 


XA. . 


2 


0 


30 


25 


5.000 








SD . 


5 


0 


70 


42 


UN... 


21 


1 


32 


68 


xc... 


2 


0 


30 


25 













1 For arrangement alphabetically first under initial letters and then under final letters, see Table 6- A. 
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Tabld 9-A — The 428 different digraphs of Table 6-A, arranged first alphdbcticaUy according to 
their final letters and then according to their absolute frequencies, accompanied by the logarithms 
of their assigned probabilities 



F 


Lu(F) 


(ZF) 


r 


Lii(F) 


Lra* 

(2F) 


F 


Lio(F) 




F 


Lio(F) 


Liu 

(ZF) 


RA 




39 


1 


59 


80 


EC 




32 


B 




76 


RE.. 


98 


1 


99 


96 


GF. _ 


2 


0 


30 


25 


MA 




36 


1 


56 


78 


IC 




22 


Ej 




69 


TE._ 


71 


1 


85 


91 


PF 


2 


0 


30 


25 


EA. 


.. 


35 


1 


54 


78 


NC 




19 


El 




67 


NE... 


57 


1 


76 


87 


CF.. 


1 


0 


00 


13 


DA 


... 


32 


1 


51 


76 


AC 




14 


i 


15 


61 


VE_ 


57 


1 


76 


87 


MF... 


1 


0 


00 


13 


LA 


_ 


28 


1 


45 


74 


SC 


. 


13 


i 


11 


59 


SE 


49 


1 


69 


84 


UF... 


1 


0 


00 


13 


TA 


. 


28 


1 


45 


74 


RC 




9 


0 


95 


53 


EE 


42 


1 


62 


81 


XF . 


1 


0 


00 


13 


NA 




26 


1 


41 


72 


OC 




8 


0 


90 


51 


LE 


37 


1 


57 


79 












SA 




24 


1 


38 


71 


TC 




6 


0 


78 


45 


DE 


33 


1 


52 


77 












CA 




20 


1 


30 


67 


DC 


. 


4 


0 


60 


38 


CE 


32 


1 


51 


76 


MG . 


27 


1 


43 


73 


HA 




20 


1 


30 


67 


YC 




4 


0 


60 


38 


ME 


26 


1 


41 


72 


IG. 


19 


1 


28 


67 


PA. 




14 


1 


15 


61 


CC 




3 


0 


48 


33 


PE 


23 


1 


36 


70 


UG . 


8 


0 


90 


51 


WA 




12 


1 


08 


58 


HC 




3 


0 


48 


33 


WE 


22 


1 


34 


69 


RG . 


7 


0 


85 


48 


IA 




8 


0 


90 


51 


LC 




3 


0 


48 


33 


HE . 


20 


1 


m 


67 


AG 


6 


0 


78 


45 


GA. 




7 


0 


85 


48 


MC 




3 


0 


48 


33 


BE 


18 


1 


26 


66 


EG. 


4 


0 


60 


38 


OA 




7 


0 


85 


48 


uc 




3 


0 


48 


33 


GE . 


14 


1 


15 


61 


DG.. 


2 


0 


30 


25 


VA 




6 


0 


78 


45 


FC 




2 


0 


30 


25 


IE . 


13 


1 


11 


59 


OG . 


2 


0 


30 


25 


YA 




6 


0 


78 


45 


GC 




2 


0 


30 


26 


UE 


11 


1 




56 


SG ... 


2 


0 


30 


25 


FA 




5 


0 


70 


42 


XC 




2 


0 


30 


25 


FE . 


10 


1 




55 


FG . 


1 


0 


00 


13 


UA 




5 


0 


70 


42 


KC 




1 


0 


00 


13 


YE . 


9 


0 


95 


53 


GG . 


1 


0 


00 


13 


BA 




4 


0 


60 


38 


PC 


. 


1 


0 


00 


13 


KE 


6 


0 


78 


45 


LG. 


1 


0 


00 


13 


AA 




3 


0 


48 


33 














OE . 


3 


0 


48 


33 


TG 


1 


0 


00 


13 


XA 




2 


0 


30 


25 














JE ._ 


2 


0 


30 


25 


YG . 


1 


0 


00 


13 


JA 




1 


0 


00 


13 


ED. 


m 


60 


1 


78 


88 


ZE _ 


2 


0 


30 


25 












KA 




1 


0 


00 


13 


ND 




52 


1 


72 


85 


AE 


1 






13 












ZA 




1 


0 


00 


13 


AD 




27 


1 


43 


73 


XE. 


1 






13 
























RD. 




17 


1 


23 


64 












TH . 


78 


1 


89 


92 


AB 




6 


0 


78 


45 


OD 


. 


12 


1 


08 


58 












SH. 


26 


1 


41 


72 


MB 




6 


0 


78 


45 


LD 




9 


0 


95 


53 












GH . 


20 


1 


30 


67 


DB 




4 


0 


60 


38 


DD 


_ 


8 


0 


90 


51 


OF . 


25 


1 


m 


72 


CH. 


14 


1 


15 


61 


EB 




4 


0 


60 


38 


ID. 


... 


6 


0 


78 


45 


EF . 


18 


1 


26 


66 


EH . 


7 


0 


85 


48 


OB. 




4 


0 


60 


38 


TD 




6 


0 


78 


45 


SF . 


12 


1 




58 


NH 


4 


0 


60 


38 


LB. 




3 


0 


48 


33 


SD 




5 


0 


70 


42 


FF . 


11 


1 




66 


WH 


4 


0 


60 


38 


SB 




3 


0 


48 


33 


YD 




4 


0 


60 


38 


YF 


11 


1 


04 


56 


OH . 


3 


0 


48 


33 


TB 




3 


0 


48 


33 


UD 




3 


0 


48 


33 


IF 


10 


1 


[ml 


55 


PH . 


3 


0 


48 


33 


UB 




3 


0 


48 


33 


HD 




2 


0 


30 


25 


NF 


9 


0 


95 


53 


RH . 


3 


0 


4& 


33 


IB 




2 


0 


30 


25 


CD 




1 


0 


00 


13 


DF 


8 


0 


90 


51 


AH . 


2 


0 


30 


25 


NB 




2 


0 


30 


25 


FD 




1 


0 


00 


13 


TF - 


7 


0 


85 


48 


DH 


2 


0 


30 


25 


RB 




2 


0 


30 


25 


GD 




1 


0 


00 


13 


RF- 


6 


0 


78 


45 


LH 


1 


0 


00 


13 


YB 




2 


0 


30 


25 


MD 




1 


0 


00 


13 


HF 


5 






42 


MH 


1 


0 


00 


13 


HB 




m 


0 


00 


13 


PD 




1 


0 


00 


13 


AF 


4 






38 


XH 


1 


0 


00 


13 


PB 


- 


1 


0 


00 


13 


XD 


- 


1 


0 


00 


13 


LF 


3 






33 


YH 


1 


0 


00 


13 
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Table 9-A, Contd — The i28 different digraphs of Table 6-A, arranged first alphabetically accord- 
ing to their final letters and then according to their absolute frequencies, accompanied by the 
logarithms of their assigned probabilities 
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Table 9-A, Concluded — The 428 different digraphs of Table 6-A, arranged first alphdbettciMy 
according to their final letters and then according to their absolute frequencies, accompanied by 
the logarithms of their assigned probabilities 



* 


LaCT) 


■m 


r 


Lu(F) 


L*a 

(2F) 


JP 


Lio(T) 


li*3< 

(2F) 


r 


Lu(F> 


Lm 

<2F) 


ES 


54 


n 




86 


OT. 


19 


1 


28 




JU„ 


2 


0 


30 


25 


PW. 


1 


0 


00 


13 


AS. . 


41 


H 




80 


TT 


19 


1 


28 


67 


LU_. 


2 


0 


30 


25 


YW-... 


1 


0 


00 


13 


I S_„ 


85 


B 




78 


DT_. 


15 


1 


18 


62 


MU . 


2 


0 


30 


25 












RS 


31 


i 


49 


75 


YT. _ 


15 


1 


18 


62 


YU ._ 


1 


0 


00 


13 


TY 


IK 


1 


1 Q 


go 


NS .. 
SS . 


24 

19 


i 

i 


38 

28 


71 

67 


CT_. 

UT 


14 

12 


1 

1 


15 

08 


61 

58 


IV.. 


25 


1 


40 


72 


-LA — 

EX. .. 

nv 


10 

7 

1 


0 

A 


lo 

85 

nn 


48 

1 Q 


TS. 


19 


i 


28 


67 


FT. 


11 


1 


04 


56 


EV . 


20 


1 


30 


67 


UA 


1 


u 




ID 


OS_. 


14 


i 


16 


61 


LT 


8 


0 


90 


51 


AV.. 


7 


0 


85 


48 












DS 


13 


i 


11 


59 


PT 


8 


0 


90 


51 


OV . 


7 


0 


85 


48 


TY ._ 


41 


1 


61 


80 


US_.. 


12 


i 


08 


58 


XT 


7 


0 


85 


48 


RV 


5 


0 


70 


42 


AY... 


12 


1 


08 


58 


YS. _ 


11 


i 


04 


56 


GT 


4 


0 


60 


38 


DV. 


3 


0 


48 


33 


LY... 


10 


1 


00 


55 


LS_. 


6 


0 


78 


45 


MT 


2 


0 


30 


25 


NV . 


3 


0 


48 


33 


RY... 


9 


0 


95 


53 


PS. . 


6 


0 


78 


46 


BT 


1 


0 


00 


13 


LV 


2 


0 


30 


25 


BY .. 


7 


0 


85 


48 


HS 


4 


0 


60 


38 


VT. 


1 


0 


00 


13 


PV . 


1 


0 


00 


13 


NY 


5 


0 


70 


42 


MS _ 


4 


0 


60 


38 












SV... 


1 


0 


00 


13 


EY.. 


4 


0 


60 


38 


FS... 


3 


0 


48 


33 


OU. 


37 


1 


67 


79 


UV . 


1 


0 


00 


13 


MY 


2 


0 


30 


25 


GS. . 


3 


0 


48 


33 


QU 


15 


1 


18 


62 












OY ... 


2 


0 


30 


25 


BS .. 


1 


0 


00 


13 


AU 


13 


1 


11 


59 


TW.. 


36 


1 


56 


78 


CY 


1 


0 


00 


13 


CS _ 


1 


0 


00 


13 


SU. 


11 


1 


04 


- 56 


OW.. 


8 


0 


90 


51 


DY.._ 


1 


0 


00 


13 


KS 


1 


0 


00 


13 


HU 


8 


0 


90 


51 


EW_. 


7 


0 


85 


48 


FY 


1 


0 


00 


13 


WS .. 


1 


0 


00 


13 


NU . 


7 


0 


85 


48 


DW . 


4 


0 


60 


38 


HY... 


1 


0 


00 


13 


XS_. 


1 


0 


00 


13 


DU 


5 


0 


70 


42 


RW 


4 


0 


60 


38 


PY 


1 


0 


00 


13 












RU 


5 


0 


70 


42 


SW ... 


4 


0 


60 


38 


SY.. 


1 


0 


00 


13 


NT... 


82 


1 


91 


93 


TU . 


5 


0 


70 


42 


AW . 


3 


0 


48 


33 


WY 


1 


0 


00 


13 


ST 


63 


1 


80 


88 


CU. 


4 


0 


60 


38 


NW 


3 


0 


48 


33 












AT_ 


47 


1 


67 


83 


EU . 


3 


0 


48 


33 


LW_ . 


2 


0 


30 


25 


T *7 


Q 


A 


QA 


OK 


RT 


42 


1 


62 


81 


FU . 


3 


0 


48 


33 


CW ... 


1 


0 


00 


13 


J.4i .. 
t?*7 


a 


u 

A 


oU 

AA 


60 
1 o 


ET 


37 


1 


57 


79 


PU 


3 


0 


48 


33 


FW. . 


1 


0 


00 


13 


Ct/j m „ 


1 

1 


u 

A 


uu 

AA 


• lo 
1 o 


HT . 


28 


1 


45 


74 


BU.. 


2 


0 


30 


25 


GW... 


1 


0 


00 


13 


X4J _ . 


1 


u 


UU 


lo 


IT .. 


27 


1 


43 


73 


GU... 


2 


0 


30 


25 


HW . 


1 


0 


00 


13 


5, 


ooo 
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Table 9-B — The 18 digraphs composing 25% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their final letters 



(1) AND ACCORDING TO THEIR INITIAL (2) AND ACCORDING TO THEIR ABSOLUTE 
LETTERS FREQUENCIES 



F 


L»(F) 


lau 

( 2 r) 


F 


Lu(F) 


WDjW 


F 


L»(F) 




F 


Li»(F) 


XT 

(2F) 


ED... 60 


1 78 


88 


IN.. 76 


1 88 


92 


ED„ 60 


1 78 


88 


IN . 75 


1 88 


92 


ND. 52 


1 72 


85 


ON.. 77 


1 89 


92 


ND... 52 


1 72 


85 


AN— 64 


1 81 


89 


NE. 67 


1 76 


87 


TO.. 50 


1 70 


84 


RE. 98 


1 99 


96 


TO .. 50 


1 70 


84 


RE... 98 


1 99 


96 








TE. .. 71 


1 85 


91 








SE 49 


1 69 


84 


ER-. 87 


1 94 


94 


NE... 57 


1 76 


87 


ER... 87 


1 94 


94 


TE . 71 


1 85 


91 


OR ._ 64 


1 81 


89 


VE . 57 


1.76 


87 


OR. 64 


1 81 


89 


VE_ . 57 


1 76 


87 








SE.. 49 


1 69 


.84 














ES 54 


1 73 


86 








ES . 64 


1 73 


86 


TH... 78 


1 89 


92 


NT. . 82 


1 91 


93 


TH . 78 


1 89 


92 


NT— 82 


1 91 


93 








ST... 63 


1 80 


88 








ST.... 63 


1 80 


88 


AN.... 64 


1 81 


89 


1,249 






EN. Ill 


2 05 


99 


1,249 






EN ...111 


2 05 


99 








ON. 77 


1 89 


92 









Table 9-C — The 53 digraphs composing 50% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their final letters 



(1) AND ACCORDING TO THEIR INITIAL LETTERS 
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Table 9-C, Concluded — The 53 digraphs composing 50% of the 5,000 digraphs of Table 6-A, 
accompanied by the logarithms of their assigned probabilities, arranged alphabetically according io 
their final letters 



(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES 




Table 9-D — The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, accompanied by 
the logarithms of their assigned probabilities, arranged alphabetically according to their final letters 




2-26 



R ESTRICTED 





























REF ID:A56892 



RESTRICTED 



Table 9-D, Could —The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, ac- 
companied by the logarithms of their assigned probabilities, arranged alphabetically according to 
their final letters 



(1) AND ACCORDING TO THEIR INITIAL LETTERS— Concluded 




(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES 



Ln 

CZT) 



Li»(F) 



Lhi 

J2F) 



Lio(r) 



F | L„(F) I 



F 



L»(r> 



T3T" 

gF) 



RA. 

MA 

EA.. 

DA 

LA. 

TA. 

NA. 

SA. 

CA. 

HA. 

PA. 

EC. 
IC 
NC 
AC. 

ED. 
ND 
AD 
RD 



39 

36 

35 

32 

28 

28 

26 

24 

20 

20 

14 

32 

22 

19 

14 

60 

52 

27 

17 



59 

56 

54 

51 

45 

45 

41 

38 

30 

30 

15 



1 51 
1 34 
1 28 
1 15 

,1 78 
1 72 
1 43 
1 23 



80 

78 

78 

76 

74 

74 

72 
71 
67 
67 
61 

76 

69 

67 

61 

88 

85 

73 
64 



RE. 

TE 

NE 

VE 

SE. 

EE 

LE. 

DE. 

CE. 

ME 

PE 

WE. 

HE 

BE 

GE 

IE.. 

OF 

EF. 

NG. 

IG 



98 

71 

57 

67 

49 

42 

37 

33 

32 

26 

23 

22 

20 

18 

14 

13 

25 

18 

27 

19 



99 

85 

76 

76 

69 

62 

57 

52 

51 

41 

36 

34 

30 

26 

15 

11 



1 40 
1 26 

1 43 
1 28 



96 

91 

87 

87 

84 

81 

79 

77 

76 

72 

70 

69 

67 

66 

61 

59 

72 
66 

73 
67 



TH. 

SH 

GH. 

CH 

TI 

FI. 

SI . 

HI. 

NI 

RI 

DI 

El. 

LI 

AI.. 

AL 

EL 

LL. 

IL. 

OL. 



78 

26 

20 

14 

46 

39 

34 

33 

30 

30 

27 

27 

20 

17 

32 

29 

27 

23 

19 



1 89 
1 41 
1 30 
1 15 



65 

59 

53 

52 

48 

48 

43 

43 

30 

23 



1 51 
1 46 
1 43 
1 36 
1 28 



92 

72 
67 
61 

82 

80 

77 

77 

75 

75 

73 

73 
67 
64 

76 

74 
73 
70 
67 



OM. 
AM 

EM. 

EN. . 

ON. 
IN 
AN 
UN. 

TO- 
CO- 
10. 
FO. 
RO . 
HO. 
WO. 
NO. 
PO 
DO 
SO. 
LO. 



. 25 
. 14 
. 14 

111 
. 77 
. 75 
. 64 
. 21 

. 50 
. 41 
. 41 
. 40 
. 28 
. 20 
. 19 
18 
17 
. 16 
. 15 
. 13 



1 40 
1 15 

1 15 

2 05 
1 89 
1 88 
1 81 
1 32 



70 

61 

61 

60 

45 

30 

28 

26 

23 

20 

18 

11 



72 

61 

61 

99 

92 

92 

89 

68 

84 

80 

80 

80 

74 

67 

67 

66 

64 

63 

62 

59 
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Table 9-D, Concluded —The 122 digraphs composing 75% of the 5,000 digraphs of Table 6-A, 
accompanied by the logarithms of their assigned probabilities, arranged alphabetically according to 
their final letters 

(2) AND ACCORDING TO THEIR ABSOLUTE FREQUENCIES— Concluded 



* 


L„(F) 


Lu« 

(an 


F 


Lio(F) 




F 


Lm(F) 


(2F) 


F 


L»(F) 


Liu 

(2F) 


OP.... 26 


1 40 


72 


ES.. 64 


IBS 


86 


NT.. 82 


1 91 


93 


OU. 37 


EC 


79 


EP... 20 


1 80 


67 


AS.. 41 




80 


ST 63 


1 80 


88 


QU . 15 


mm 


62 








IS. 35 




78 


AT . 47 


1 67 


83 


AU_. 13 


Ed 


59 








RS_ 31 


1 49 


75 


RT. 42 


1 62 


81 


tw ntc 


1 A A 


70 


ER . 87 


1 94 


94 


NS . 24 


1 38 


71 


ET 37 


1 57 


79 


XV._ - 40 


JL 4U 

1 OA 


l 4 
C7 


OR... 64 


1 81 


89 


SS .. 19 


1 28 


67 


HT . 28 


1 45 


74 


Shy — 4\) 


1 oU 


D l 


AR. 44 


1 64 


82 


TS. 19 


1 28 


67 


IT 27 


1 43 


73 


Tff 36 


1 56 


.78 


UR... 31 


1 49 


75 


OS 14 


1 15 


61, 


OT . 19 


1 28 


67 


TV 1 K 


1 13 


ao 


1R.. 27 


1 43 


73 


DS.. 13 


1 11 


59 


TT . 19 


1 28 


67 


— lo 


1 lo 


\)4 


PR.. 18 


1 26 


66 








DT . 15 


1 18 


62 


TY. 41 


1 61 


80 


HR ._ 17 


1 23 


64 








YT . 15 


1 18 


62 


3,746 






TR 17 


1 23 


64 








CT . 14 


1 15 


61 









Table 9-E — All the 428 different digraphs of Table 6-A, arranged alphabetically first according to 
their final letters and then according to their initial letters 
(SEE TABLE 6-A —READ DOWN THE COLUMNS) 



Table 10-A — The 56 tngraphs appearing 100 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged according to their absolute frequencies, accompanied by the loga- 
rithms of their assigned probabilities 



F 


L»(F) 


(2F) 


F 


Lu(F> 


IPS* 


P 1 


Lu(D 


~X — 

(2P) 


ENT 


569 


2 


76 


99 


TOP 


. 174 


2 


24 


82 


EIG 


135 


2 


13 


79 


ION 


260 


2 


41 


88 


NTH . . . 


.. 171 


2 


23 


82 


FIV .. _ . 


135 


2 


13 


79 


AND 


228 


2 


36 


86 


TWE 


_. 170 


2 


23 


82 


MEN. ... 


131 


2 


12 


78 


ING 


226 


2 


35 


86 


TWO _. 


. 163 


2 


21 


81 


SEV 


131 


2 


12 


78 


IVE 


225 


2 


35 


86 


ATI .. . 


. 160 


2 


20 


81 


ERS.. .. 


126 


2 


10 


78 


TIO 


221 


2 


34 


85 


THR 


... 158 


2 


20 


81 


UND.. . . 


125 


2 


JO 


78 


FOR 


218 


2 


84 


85 


NTY . .. 


.. 157 


2 


20 


81 


NET . . . 


118 


2 


07 


77 


OUR 


211 


2 


32 


85 


HRE ... 


. 153 


2 


18 


80 


PER ... . 


115 


2 


06 


76 


THI... 


211 


2 


32 


85 


WEN . ._ 


... 153 


2 


18 


80 


STA 


115 


2 


06 


76 


ONE 


210 


2 


32 


85 


FOU ... 


_. 152 


2 


18 


80 


TER . .. 


DO 


2 


06 


76 


NIN 


207 


2 


32 


85 


ORT. .. 


_ 146 


2 


16 


80 


EQU . .. 


lil 


2 


06 


76 


STO .... 


202 


2 


31 


84 


REE 


146 


2 


16 


80 


RED.. . 


113 


2 


05 


76 


EEN 


196 


2 


29 


84 


SIX 


. 146 


2 


16 


80 


TED . . 


112 


2 


05 


76 


GHT_ 


196 


2 


29 


84 


ASH . 


. 143 


2 


16 


80 


ERI 


109 


2 


04 


76 


INE . 


192 


2 


28 


83 


DAS _... 


... 140 


2 


16 


79 


HIR 


106 


2 


03 


75 


VEN . . 


190 


2 


28 


83 


IGH . 


140 


2 


15 


79 


IRT .. 


105 


2 


02 


75 


EVE .. _ 


177 


2 


25 


82 


ERE . ._ 


. 138 


2 


14 


79 


DER 


101 


2 


00 


74 


EST. ._. 


176 


2 


26 


82 


COM. . 


136 


2 


13 


79 


DRE .. . 


100 


2 


00 


74 


TEE 


174 


2 


24 


82 


ATE .. 


. 135 


2 


13 


79 
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Table 10-B — The 56 trigraphs appearing 100 or more times in the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their initial letters, and then 
according to their absolute frequencies, accompanied by the logarithms of their assigned pi obabilittes 




Table 10-C — The 56 tngraphs appearing 100 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their central letters, and then ac- 
cording to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 
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Table 10-C, Concluded — The 56 tngraphs appearing 100 or more times in the 50,000 letters of 
Governmental plain-text telegrams, arranged first alphabetically according to their central letters, 
and then according to their absolute frequencies, accompanied by the logarithms of their assigned 
probabilities 



F 


L»(rj 


tnsi 

(2D 


r 


LitCT) 


■IB 


p 


Lm(F) 


Lm 

(ZD 


EQU 


114 


2 06 


76 


EST 


. 176 


2 25 


82 


OUR . .. 


211 


2 32 


85 










ASH 


143 


2 16 


80 










HRE 


153 


2 18 


80 


















ORT- 


146 


2 16 


80 


STO 


. 202 


2 31 


84 


IVE 


225 


2 35 


86 


ERE 


188 


2 14 


79 


NTH 


. 171 


2 23 


82 


EVE .. . 


177 


2 25 


82 


ERS 


126 


2 10 


78 


ATI 


160 


2 20 


81 










ERI 


109 


2 04 


76 


NTY 


157 


2 20 


81 










IRT .... 


105 


2 02 


75 


ATE 


.. 135 


2 13 


79 


TWE_. 


170 


2 23 


82 


DRE . ... 


100 


2 00 


74 


STA 


115 


2 06 


76 


TWO .. . 


163 


2 21 


81 



Table 10-D — The 56 tngraphs appeanng 100 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their final letters, and then according 
to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 
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Table 11-A — The 5i tetragrapks appearing 50 or more times m the 50,000 letters of Governmental 
plan-text telegrams, arranged according to their absolute frequencies, accompanied by the logarithms 



ElU 



Table 11-B — The 54 
plain-text telegrams , 
cording to their abso 



Lu(F) 


I mi 
<2F) 


F 


Lu(F) 


<2T> 


2 34 


99 


THIR „ . 104 


2 02 


87 


2 23 


95 


EENT . . 102 


2 01 


87 


2 21 


94 


REQU . 98 


1 99 


86 


2 21 


94 


HIRT .. 97 


1 99 


86 


2 19 


93 


COMM . 93 


1 97 


85 


2 18 


93 


QUES .. 87 


1 94 


84 


2 18 


93 


UEST . . . 87 


1 94 


84 


2 18 


93 


EQUE . 86 


1 93 


84 


2 17 


93 


NDRE . .. 77 


1 89 


82 


2 16 


92 


OMMA. .. 71 


1 86 


81 


2 15 


92 


LLAR . . 71 


1 85 


81 


2 13 


91 


OLLA .. . 70 


1 85 


81 


2 13 


91 


VENT .... 70 


1 85 


81 


2 12 


91 


DOLL . . 68 


1 83 


80 


2 12 


91 


LARS . . 68 


1 83 


80 


2 08 


89 


THIS 68 


1 83 


80 


2 06 


89 


PERI .. . 67 


1 83 


80 


2 05 


88 


ERIO. 66 


1 82 


80 



ASHT. 

HUND 

DRED. 

RIOD. 

IVED 

ENTS 

FFIC 

FROM. 

IRTY. 

RTEE. 

UNDR. 

NAUG. 

OURT. 

UGHT . 

STAT. 

AUGH. 

CENT 

FICE 



54 tetragrapks appearing 50 or more times in the 50,000 le 
ams, arranged first alphabetically according to their initial 
absolute frequencies, accompanied by the logarithms of their < 



h«OT) 



.. 64 

64 
63 
63 

. 62 
. 62 
. 62 1 79 
... 59 1 77 
. 69 1 77 

. 59 1 77 

... 69 1 77 
... 66 1 75 

. . 56 1 75 
56 1 75 
. .. 54 1 73 
.. 52 1 72 
... 52 1 72 
. 50 1 70 



etters of Governmental 
\ letters, and, then ale- 
assigned 'probabilities 




ASHT j 64 1 81 

AUGH_ .. 52 1 72 



DASH. 

DOLL. 

DREb. 

EVEN- 

ENTY. 

EIGH. 

ENTH 

EENT 

EQUE_ 

ERIO. 

ENTS- 

FOUR. 

FIVE.. 

FFIC- 

FROM. 

FICE. 



93 1 97 
52 1 72 

132 2 12 
68 1 83 
63 1 80 

168 2 23 
161 2 21 
132 2 12 
114 2 06 
102 2 01 
86 1 93 
66 1 82 
62 1 79 

144 2 16 
185 2 13 
62 1 79 
59 1 77 
50 1 70 



F 


Lio(F) 


Lni 

(2F) 


F 


Lio(F) 


HREE .. 134 


2 13 


91 


REQU. . .. 98 


1 99 


HIRT. .. 97 


1 99 


86 


RIOD 63 


1 80 


HUND .... 64 


1 81 


79 


RTEE 59 


1 77 


IGHT. . 140 


2 15 


92 


STOP .. ._ 154 


2 19 


IVED ._ 62 


1 79 


78 


SEVE 121 


2 08 


IRTY. 69 


1 77 


78 


STAT. . . 64 


1 73 


LLAR ._ 71 


1 85 


81 


TION ..... 218 


2 34 


LARS . 68 


1 83 


.80 


TEEN.. _ 163 


2 21 


MENT 111 


2 05 


88 


TWEN 152 


2 18 








THRE 149 


2 17 


NINE . 153 


2 18 


93 


THIR 104 


2 02 


NDRE .... 77 


1 89 


82 


THIS . . 68 


1 83 


NAUG. 66 


1 75 


77 






OMMA 71 


1 85 


81 


UEST 87 

UNDR ... 59 


1 94 
1 77 


OLLA.. .. 70 


1 85 


81 


UGHT ... 56 


1 75 


OURT 56 


1 75 


77 






PERI 67 


1 83 


80 


VENT . . 70 


1 85 


QUES... . 87 


1 94 


84 


WENT 153 


2 18 
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Table 11-C — The 5k tetragraphs appearing 50 or more times in the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their second letters, and then ac- 
cording to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 



DASH 

LARS 

NAUG. ... 

NDRE 

TEEN 

WENT 

SEVE .... 

MENT 

EENT 

REQU 

UEST 

VENT 

PERI 

CENT 

FFIC 

IGHT 

UGHT 

THRE 

THIR .. . 
THIS .. . . 



p 


Li«(F) 


Lm 

(2F) 


p 


L»(P) 


132 


2 


12 


91 


TION. . 


218 


2 


34 


68 


1 


83 


80 


NINE 


153 


2 


18 


66 


1 


76 


77 


FIVE . . 


135 


2 


18 










EIGH . .. 


132 


2 


12 


77 


1 


89 


Kjfli 


HIRT 


97 


1 


99 


163 


2 


21 




RIOD . . 


63 


1 


80 


163 


2 


18 


93 


FICE . 


60 


1 


70 


121 


2 


08 


89 










111 


2 


05 


88 


LLAR 


71 


1 


85 


102 


2 


01 


87 


OLLA . 


70 


1 


85 


98 


1 


99 


86 










87 


1 


94 


84 


OMMA 


71 


1 


85 


70 


1 


85 


81 










67 


1 


83 


80 


ENTY . 


161 


2 


21 


62 


1 


72 


76 


ENTH. 


114 


2 


06 










ENTS. . 


62 


1 


79 


62 


1 


79 


78 


UNDR. . 


59 


1 


77 


140 


2 


16 


92 










56 


1 


75 


77 


FOUR 


144 


2 


16 










COMM . . 


93 


1 


97 


149 


2 


17 


93 


DOLL . .. . 


68 


1 


83 


104 


2 


02 


87 










68 


1 


83 


80 


EQUE 


86 


1 


93 



1BMJ 

mimm 


p 


Lu(F) | 


Lm 

(2D 


99 


HREE 134 


2 13 


91 


93 


ERIO 66 


1 82 


80 


91 | 


DRED ... 63 


1 80 


79 


91 


FROM 59 


1 77 


78 


86 

79 


IRTY ... 59 


1 77 


78 


75 


ASHT 64 


1.81 


79 


81 


STOP . 154 


2 19 


93 


81 


RTEE . . 59 


1 77 


78 


81 


STAT 54 


1 73 


76 


94 

89 

78 

78 


QUES. ... 87 


1 94 


84 


HUND 64 


1 81 


79 


OURT 56 


1 75 


77 


AUGH . . 52 


1 72 


76 


92 


EVEN .. 168 


2 23 


95 


85 

80 


IVED ... 62 


1 79 


78 


84 


TWEN 152 


2 18 


93 



Tablb 11-D — The 5k tetragraphs appearing 50 or more times m the 50,000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their third letters, and then according 
to their absolute frequencies, accompanied by the logarithms of their assigned probabilities 



p 


Lia(F) 


1*4 

(2F) 


p 


Lm(F) 


Lll4 

<2F) 


F 


Lio(F) 


LtM 

(2D 


LLAR .. 71 

STAT 54 

FICE 60 

UNDR . .. 69 

EVEN . 168 

TEEN 163 

TWEN.. .. 152 
HREE ... 134 

QUES 87 

DRED 63 

IVED. . 62 
RTEE.. .. 59 


1 85 
1 73 

1 70 

1 77 

2 23 
2 21 
2 18 
2 13 
1 94 
1 80 
1 79 
1 77 


81 

76 

.76 

78 

95 

94 

93 

91 

84 

79 
78 
78 


EIGH 132 

AUGH. .. 52 

IGHT 140 

ASHT . ... 64 

UGHT.. ... 56 

THIR .... 104 
THIS . ... 68 

ERIO ... 66 

FFIC... . 62 

OLLA ..._ 70 

DOLL. ... 68 


2 12 

1 72 

2 15 
1 81 

1 75 

2 02 
1 83 
1 82 
1 79 

1 85 
1 83 


91 

76 

92 

79 

77 

87 

80 
80 

78 

81 

80 


1 


1 97 

1 85 

2 18 
2 18 
2 05 
2 01 
1 85 
1 81 

1 72 

2 34 
2 19 
1 80 
1 77 


85 

81 

93 

93 

88 

87 

81 

79 

76 

99 

93 

79 

78 
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Table 11-D, Concluded — The Bit tetragraphs appearing BO or more times m the 50,000 letters of 
Governmental plain-text telegrams, arranged first alphabetically according to their third letters, and 
then according to their absolute frequencies, accompanied by the logarithms of their assigned 
probabilities 



F 


L m (T) 


mmmm 


p 


Lu(F) 


Lrt 

( 2 r> 


r 


Lio(F) 


Lrn 

( 2 r> 


REQU. ... 


98 


1 99 


86 


DASH . . . 


132 


2 12> 


91 


FOUR . . 


144 


2 16 


92 










UEST. . . 


87 


1 94 


84 


EQUE 


86 


1.93 


84 


THRE . 


149 


2 17 


93 










NAUG 


56 


1 76 


77 


HIRT 


97 


1 99 


86 


















NDRE .. 


77 


1 89 


82 


ENTY. 


161 


2 21 


94 










LARS. .. 


68 


1 83 


80 


ENTH .. . 


114 


2 06 


89 










PERI . . . 


67 


1 83 


80 


ENTS .. 


62 


1 79 


78 


FIVE ... 


135 


2 13 


91 


OURT 


56 


1 75 


77 


IRTY. .. 


59 


1 77 


78 


SEVE.. „ 


121 


2 08 


89 



Table 11-E — The Bit tetragraphs appearing SO or more times m the BO, 000 letters of Governmental 
plain-text telegrams, arranged first alphabetically according to their final letters, and then according 
to their absolute frequencies, accompanied by the logarithms of tlmr assigned probabilities 



OMMA .. 
OLLA . . 

FFIC .... 



Lutn 



1*224 

(2F) 



71 1 85 
70 1 85 



81 

81 



62 



1 79 



78 



DASH 
EIGH . 
ENTH 
AUGH . 



132 

132 

114 

52 



Lu(D 

2 12 
2 12 
2 06 
1 72 



Lau 

(2F) 

91 

91 

89 

76 



QUES 

THIS 

LARS 

ENTS 



HUND .. . 
DRED... 

RIOD 

IVED. 



64 

63 

63 

62 



1 81 
1 80 
1 80 
1 79 



NINE. . 

THRE 

FIVE 

HREE ... 
SEVE . 
EQUE ... 
NDRE 
RTEE ._ 
FICE .. 



153 

149 

135 

134 

121 

86 

77 

59 

50 



2 18 
2' 17 
2 13 



13 

08 

93 

89 

77 

70 



NAUG ... 



56 



1 75 



79 

79 

79 

78 



PERI 

DOLL 

COMM . 
FROM . 



93 

93 

91 

91 

89 

84 

82 

78 

75 



77 



TION . . 
EVEN. . 
TEEN . 
TWEN 

ERIO 

STOP ... 

FOUR . . 
THIR . . 
LLAR . _ 

UNDR ._ 



67 

68 

93 

59 

218 

168 

163 

152 

66 

154 



83 

83 

97 

77 

34 

23 

21 

18 

82 

19 



80 

80 

85 

78 

99 

95 

94 

93 

80 

93 



WENT 

IGHT . 

MENT 

EENT . .. 

HIRT 

UEST ... 
VENT. . . 

ASHT 

OURT 

UGHT 

STAT 

CENT 



144 

104 

71 

59 



16 

02 

85 

77 



92 

87 

81 

78 



REQU. . 

ENTY 

IRTY .. 



F 


Lio(F) 


Lra 

(2D 


87 


1 94 


84 


68 


1 83 


80 


68 


1 83 


80 


62 


1 79 


78 


153 


2 18 


93 


140 


2 15 


92 


111 


2 05 


88 


102 


2 01 


87 


97 


1 99 


86 


87 


1 94 


84 


70 


1 85 


81 


64 


1 81 


79 


56 


1 75 


77 


66 


1 75 


77 


54 


1 73 


76 


52 


1 72 


76 


98 


1 99 


86 


161 


2 21 


94 


59 


1 77 


78 
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Table 12 — Average length of words and messages 



Number of 
letters m 
word 

X 


Number of 
times s-letter 
word appears 


Number of 
letters 


1 


378 


378 


2 


973 


1,946 


3 


1,307 


3,921 


4 


1,635 


6,540 


5 


1,410 


7,050 


6 


1,143 


6,858 


7 


1,009 


7,063 


8 


717 


5,736 


9 


476 


4,284 


10 


274 


2,740 


11 


161 


1,771 


12 


86 


1,032 


13 


23 


299 


14 


23 


322 


15 


4 


60 




9,619 


50,000 



(1) Average length of words 6 2 letters 

(2) Average length of messages.. 217 letters 

(3) Modal (most frequent) length 105-114 letters 



(4) It is extremely unusual to find five consecutive letters without at least one vowel 
(6) The average number of letters between vowels is two 
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A 



Tabld 13 — Checkerboard individual frequencies 1 

(Based on a count ol 6 000 digraphs] 



Pi Cl 



A 


B 


C 


D 


E 


244 


225 


375 


394 


197 


F 


G 


H 


I J 


K 


125 


98 


193 


271 


95 


L 


M 


N 


0 


P 


229 


199 


188 


350 


251 


Q 


R 


S 


T 


U 


148 


162 


258 


427 


295 


V 


ff 


X 


Y 


Z 


42 


12 


34 


91 


97 


212 


317 


358 


308 


249 


A 


B 


C 


D 


E 


120 


108 


216 


256 


85 


F 


G 


H 


1 J 


K 


216 


140 


152 


435 


269 


L 


M 


N 


0 


P 


206 


121 


306 


364 


284 


Q 


R 


S 


T 


U 


38 


29 


21 


147 


43 


V 


W 


X 


Y 


Z 



C 8 P, 



1 The numbers in the C l Cj squares represent the frequency of the individual components of the cipher digraph 
used to replace a given Pj Pi digraph in accordance with a digraphic checkei board system where Pi and Pi are the 
plam-text squares 
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Table 14 — Relative logarithmic values of frequencies of English digraphs 

[Based on a count o{ 5,000 digraphs. To obtain logarithm to baso 10 (Log 10) divide by 100] 

Second Letter 

ABODE FGHIJKLMN 0 P QRSTUVWXYZ 



A 


48 


1 


116 


143 


00 


e 


78 


E 


123 


m 




161 


115 


181 


E 




He 


164 


161 


167 


ill 


85 


48 


He 




He 


B 


60 


* 


A 


* 


126 


He 


* 


* 




1 


H* 


78 


E 


He 


E 


He 


He 


30 






E 


He 


He 


He 


85 


He 


C 


B 


* 


48 


E 


151 


E 


* 


115 


85 


* 


m 






E 




* 


A 


60 




115 


H 


* 




* 


E 


* 


D 


161 


60 


EE 




162 


E 


E 


E 


143 


I 


* 


48 








E 


E 


H 


111 


118 


E2 


48 




* 




* 


E 


164 


60 


161 


178 


162 


126 


E 


85 


143 


3! 


* 


146 


116 


206 




B 


108 


194 


173 


167 


48 


130 


86 


85 






F 


70 


* 


E 


E 


E 




E 


* 


159 


* 


* 


E 


E 


He 




E 


He 


95 


48 




48 


He 


B 


He 




He 


G 


85 


* 






115 


E 




E 




00 


* 




E 


48 


78 


E 


He 


E 


48 


E 


B 


He 


E 


He 


He 


He 


H 




00 


48 


E 


E 


M 


* 


* 


152 


* 


* 




E 


48 


E 


E 




123 




145 


B 


He 




He 




He 


I 


E 


HE 


135 


78 


111 


B 


128 


* 


* 


* 


30 


136 


95 


188 


161 


85 


He 


143 


164 


143 


He 


m 


H« 


118 


He 


1 


J 


00 


* 


* 


A 




* 


* 


* 


* 


* 


* 


He 


H« 


H« 




He 


He 


He 


H« 


He 


30 


He 


A 


H* 


He 


He 


K 




* 


B 


* 


78 


* 


* 


V 


E 


* 


* 




He 


E 


He 


He 


He 


He 


E 


* 


He 


He 


He 


He 


H« 


He 


L 


145 


48 


48 


95 


167 


48 


E 


E 




* 


* 


143 


E 


E 


in 


48 


He 




78 


90 




E 




* 




He 


M 


166 


78 


48 


EE 


141 


E 


* 


E 


95 


* 


* 


He 


in 


He 


H 


E 


* 




E 






* 


He 


He 




He 


N 


141 


J 


128 


172 


176 


95 


43 


E 


148 


00 


30 


70 




E 


126 


48 






138 


191 


85 


48 


48 


* 


E 


He 


0 


85 




E 


B 


48 


140 


E 


48 


70 


00 


30 


128 


B 


189 


78 


B 


He 


181 


115 


128 


157 


85 


90 


mm 

■■i 


E 


He 


P 


115 




E 


E 


136 




* 


48 


78 


* 


* 


E 


E 


E 


123 




He 


126 


78 


E 


48 


E 




He 


B 


He 


Q 


* 


* 


* 


* 


* 


* 


* 


* 


* 


H« 


He 




E 


* 


* 


He 


H« 




He 


He 


118 


He 


He 


H« 


He 


He 


R 


159 


a 


95 


123 


199 


78 


85 


48 


148 


00 


00 




95 


85 


145 


ill 


He 




149 


162 


E 


Eg 


60 


He 


95 


He 


S 


138 


48 


m 


70 


169 


108 


B 


142 


153 


* 


00 


E 


48 


E 


118 


E 


He 


E 


128 


m 


g 


E 




He 


B 


He 


T 


146 


48 


78 


78 


185 


85 




189 


165 


* 


He 


E 


78 


85 






E 


123 


128 


128 




He 


156 


He 


161 




U 


70 


48 


48 


48 


m 


El 


B 


* 




He 


* 




E 


182 


E 




He 


149 


m 


108 


He 


E 


He 


He 


He 


+ 


V 


78 


* 


* 


* 


176 


* 


* 


A 


108 


He 


* 


D 


He 


H« 


E 


He 


He 


* 


He 


m 


* 


He 


* 


He 


He 


He 


W 


i 


* 


* 


* 




* 


* 


E 


111 


* 


H« 


E 


He 


E 


128 


He 


He 


E 


m 


He 


He 


He 


He 


He 




A 


X 




* 




IS 




B 


* 


| 




* 


* 


* 


* 


E 




E 


* 


E 


E 


85 


He 


He 


He 


* 


He 


He 


Y 




m 






95 


104 


E 


E 


48 


He 


He 




E 


78 


E 


48 


He 


E 




118 




* 


E 


He 


He 


He 


Z 


00 


* 




* 


30 


* 




* 




H» 


H< 


H« 


He 


He 


He 


H- 


H= 


H 


H-- 


He 


H 


He 


He 


+ 


He 


He 



*In computations, assign a value of -100 as the log for these digraphs These combinations do not usually 
occur in 6,000 digraphs Do not assign "0” to these combinations as that is the logarithmic value for a fre- 
quency of one, and these combinations have a frequency of less than one 
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Table 15 — Relative logarithmic values { Log„ 222) of frequencies of English digraphs * 



[Based on n count of 6,000 digraphs] 

Second Letter 



ABCDEFGHIJKLMNOPQRSTUVWXYZ 




♦See pages 11-12 for details 
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SPECIAL-PURPOSE DATA 



Table l6~A . — Frequency distribution of digraphs, based on 64,365 
letters of decrypted U. S, Government messages in which Z was used as 
a word-sepai ator and X was used for both X-n and Zn* 

U+f 

ABCDEFGHIJKLMNOPQRSTUVWXY 



Z 




In the text which gave rise to this and the following two tables, the 
f re quent ly -us ed punctuation signs "comma" and "period" were aobreviated 
as CM and FD, respectively, and the procedure term "repeat" was abbre- 
viated as RED j thus, the digraphs Cll, PD, ED, and RP, which usuellly do 
not occur frequently (see Table 6-A), are of relatively high frequency 
here. 
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Tabic l6-B» — Frcciuency cl Lstr ibution of digraphs, ha^ed on the 
text used for Table l6-A, from vhich the Z wore] -c> operator has been 
omitted (total* 53/866 letters) ♦ 

2.*' L+v 





A 


B 


C 


D 


E 


F 


G 


II 


I 


J 


K 


L 


M 


H 


0 


P 


Q 


R 


s 


T 


U 


V 


w 


X 


Y 


Z 


A 


7? 


ns 


110 


164 


40 


136 


III 


26 


181 


11 


52 


227 


166 


431 


S8 


147 


3 


6 SI 


611 


315 


65 


58 


4o 


23 


67 




B 


63 


»♦ 


1 


z 


113 


5 




1 


43 


t4 

K* 




W 


6 


18 


62 


Z 




u 


11 


13 


IS 


2 


3 


3 


6o 




G 


133 


i 


31 


2.0 


043 


32 


21 


184 


HI 




48 


18 


313 


II 


414 


8 


Z 


18 


71 


180 


41 


1 


6 


f 


27 




D 


413 


u 


IOZ 


7+ 


307 


86 


26 


13 


173 


7 


S 


23 


31 


22 


ISI 


11 


16 


142 


118 


153 


SI 


40 


ss 


2 


1 8 




E 


211 


10 


381 


481 


CIO 


283 


48 


37 


326 


21 


12 


201 


tlo 


8 55 


181 


278 


13 


131 


476 


367 


S3 


2IS 


87 


136 


3f 




F 


Co 


14 


41 


2S 


101 


137 


7 


Z 


380 


3 


1 


31 


IS 


10 


581 


80 


1 


14? 


S6 


67 


41 


3 


1 


3 


7 




G 


IOZ 


31 


zo 


SI 


264 


<1 


31 


261 


37 


S' 


2 


12 


10 


41 


4S 


38 


4 


41 


53 


38 


31 


2 


3 


"7 


1 




H 


no 


? 


34 


28 


215 


S4 


13 


31 


210 


14 


It 


8 


13 


34 


131 


If 


23 


231 


64 


3IS 


18 


3 


(6 


S 


3 




I 


@3 


B 


S3 


□ 




m 


330 


8 


s ■ 


1 


n 






70S 


?,ol 


Bf 


1 


Q 




21? 


2 


270 


3 


ES 


B 




J 


ffi 


■ 


7 


2 


31 


Q 




i 


21 


zo 


■ 


Q 


a 


S 




B 


B 


6 


2 


11 


1 


2 


2 




5 




K 


31 




1 


m 


108 


IO 


2 




SC 


3 


m 


D 


a 


13 


□ 


B 


i 


6 


II 


S 


2 


1 


4 


a 


□ 




L 


CD 




38 


61 


338 


47 


1 


13 


2o7 


a 


4 


143 


m 


68 




11 




21 




So 


a 


ff 


If 


i 






M 


SIS 


EB 




□ 


206 


22 


C4 


4 


16 


D 


1 


6 


9 


□ 


EJ 


123 


3 


26 


□ 


40 


22 


2 


lo 


is 


n 




N 


SI 


□ 




33C 


748 


131 


1S4 


11 


263 


c 


11 


E9 


47 


EB 


Q] 


12 


□ 


66 


^2 


3S1 


15 


23 


28 


28 




■ 


0 


EH 


E 


HI 


13 


9 


4X6 


12 


47 


37 


13 


27 


172 


252 


110 


ii 


111 


X 


473 


204 


214 


411 


SI 


68 


170 


17 


i 


P 


an 


□ 


1C 


5fj 


g 


5 


3 


40 


21 


4 




76 


44 


ii 


150 


HI 


3 


171 


m 


56S 


44 


2 


2 


1 


S' 




Q 


D 


f 


3 




7 


Z 




4 


5 




l 


2 


5 


ii 


S 


z 


1 


lo 


IO 


2 


117 


B 


3 


1 






R 


SI 


a 


131 


lie 


loll 


84 


66 


14 


207 


11 


40 


61 


142 


SI 


631 


361 


8 


103 


266 


263 


67 


□ 


21 


^0 


7f 




S 




31 


143 


31 


316 


141 


SS 


4S3 


361 


5 


11 




□ 


34 


173 


111 


16 


62 


178 


585 


I'M 


14 


3f 


2 


43 




T 




30 


147 


7o 


400 


17 


21 


six 


308 


14 


IC 


3 


9 


loo 


463 


IS 


B 


IW 


0 


282 


m 


12 


33? 


3o 


57 




U 


m 




33 


Cl 


S« 


7 


61 


2 


36 


4 


4 




a 


17 


35 


40 




4S4 


EH 


BS 


6 


S 


6 


6 


l 




V 


Q 




13 


5“ 


411 


1 


1 




BS 




2 


<4 


a 


B 


21 


6 


B 


B 


B 


D 


i 


1 


l 


i 


7 




W 


DS 


D 


a 


1 


31 


2 


12 


3S 


le>7 


3 


1 


10 


i 


\1r 


367 


• 0 


2. 


3 


H 


6 


i 




13 


13 


f 




X 




Q 


□ 


Q 


361 


20 




4 


12 


3 


lo 


z 


i 


ii 


24 


41 


3 


26 


21 


47 


4 


1 




S4 


3 




X 


m 


D 


ES 


□ 


14 


33 


18 


S 


22 


1 


1 


7 


22 


zs 


14 


71 


1 


31 


36 


35 


10 


1 


18 




l» 


~1 


Z 
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Table 16 -C . — The 53 digraphs from Table 6 - A which comprise 50$ of 
the total, arranged according to frequencies reduced to a base of 5*000 
digraphs, shown with the corresponding frequencies of the same digraphs 
from Table l 6 -B (also reduced bo a base of 5*000 ) „ 1 



Dig. 


6 -a 


16 -B 


Dig. 


6 -A 


16 -B 


EN 


111 


79 


FO 


40 


54 


RE 


98 


94 


FI 


39 


35 


ER 


87 


86 


RA 


39 


28 


HT 


82 


33 


ET 


37 


34 


TH 


78 


55 


LE 


37 


31 


ON 


77 


84 


OU 


37 


39 


nr 


75 


65 


MA 


36 


55 


TE 


71 


36 


T W 


36 


31 


AH 


64 


4i 


EA 


35 


28 


OR 


64 


44 


IS 


35 


28 


ST 


63 


36 


SI 


34 


34 


ED 


60 


45 


DE 


33 


29 


HE 


57 


69 


HI 


33 


20 1 


VE 


57 


46 


AL 


32 


21 1 


ES 


54 


44 


CE 


32 


24 


HD 


52 


31 


DA 


32 


41 


TO 


50 


43 


EC 


32 


36 


SE 


49 


37 


RS 


31 


25 


AT 


47 


37 


UR 


31 


42 


TI 


45 


29 


HI 


30 


24 


AR 


44 


61 


RI 


30 


I 9 J- 


EE 


42 


64 


EL 


29 


19 1 


RT 


42 


24 


HT 


28 


29, 


AS 


4i 


57 


LA 


28 


18 1 


CO 


4i 


39, 


RO 


28 


59 


10 


4l 


19“ 


TA 


28 


26 


TY 


4l 


5 1 









With the exception of AL, EL, HI, 10, LA, HI, TY, the digraphs of 
this table are all from among the 65 digraphs from Table 16 -B which 
comprise 50 $ of the total. 
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A. LIST OF WORDS USED IN MILITARY TEXT ARRANGED ALPHABETICALLY 

ACCORDING TO WORD LENGTH 



TWO LETTER WORDS 



AM 


BY 


EM 


IN 


MM 


OK 


TO 


AN 


CO 


GO 


IS 


MP 


ON 


US 


AS 


CP 


HE 


IT 


MY 


OR 


WD 


AT 


CQ 


HQ 


MC 


NO 


QM 


WE 


BE 

BN 


DO 


IF 


ME 


OF 


SO 


WO 



THREE LETTER WORDS 



ACT 


BID 


DUN 


HAS 


MIX 


PVT 


TEN 


ADD 


BIG 


EAT 


HER 


NAN 


QMC 


THE 


ADJ 


BOX 


END 


HIM 


NET 


RED 


TIN 


AGE 


BUT 


EYE 


HIS 


NEW 


RID 


TON 


AGO 


BUY 


FAR 


HOW 


NOT 


ROB 


TOO 


AID 


CAM 


FEW 


ILL 


NOW 


RUN 


TOP 


AIM 


CAN 


FIT 


ITS 


OFF 


SAW 


TRY 


AIR 


CAR 


FIX 


JIG 


OLD 


SAY 


TUB 


ALL 


CAV 


FOR 


JOB 


ONE 


SEA 


TWO 


AND 


COL 


FOX 


KEG 


OUR 


SEE 


USE 


ANY 


CPL 


GAL 


LAW 


OUT 


SET 


VAT 


APT 


CUT 


GAS 


LAY 


OWE 


SGT 


WAR 


ARC 


CWT 


GEN 


LET 


OWN 


SHE 


WAS 


ARE 


DAY 


GET 


LOT 


PAR 


SIX 


WAY 


ARM 


DID 


GHQ 


LOW 


PAY 


SPY 


WET 


ASK 


DIE 


GOT 


MAJ 


PEN 


SUM 


WGT 


BAD 


DOG 


GUN 


MAN 


PER 


SUN 


WON 


BAG 


DRY 


HAP 


MAT 


PIN 


TAN 


YET 


BAR 


DUE 


HAM 


“ MEN 


PUT 


TAX 


YOU 








FOUR LETTER 


WORDS 






ABLE 


BOTH 


EACH 


FLEE 


HIGH 


LATE 


MAIN 


AIDE 


BULB 


EAST 


FORM 


HILL 


LEAD 


MANY 


ALLY 


BULK 


EASY 


FOUR 


HITS 


LEAK 


MASK 


ALSO 


CALL 


EDGE 


FROM 


HOLD 


LEFT 


MASS 


AREA 


CELL 


EYES 


FULL 


HOOK 


LESS 


MEAT 


ARMY 


CITY 


FALL 


FUSE 


INTO 


LIEU 


MEET 


ASIA 


CODE 


FARM 


FUZE 


ITEM 


LINE 


MESS 


AWAY 


COOK 


FAST 


GUNS 


JOIN 


LIST 


MIKE 


AXIS 


DARK 


FEEL 


HALF 


JULY 


LOAD 


MILE 


BACK 


DASH 


FEET 


HALT 


JUNE 


LONG 


MINE 


BASE 


DATE 


FELL 


HAND 


JUST 


LOOK 


MORE 


BEEN 


DAYS 


FILE 


HARD 


KEEP 


LOSS 


MOVE 


BLUE 


DIRT 


FIRE 


HAVE 


KIND 


LOST 


MTCL 


BODY 


DOWN 


FIRM 


HEAD 


KING 


LOVE 


MULE 


BOMB 


DRAW 


FIVE 


HERD 


LAND 


MADE 


NAVY 


BOOK 


DUMP 


FLAG 


HERE 


LAST 


MAIM 


NEAR 
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FOUR LETTER WORDS— Continued 



NEXT 


PARK 


REAR 


SHOT 


TEAM 


TOOK 




WEST 


NINE 


PASS 


RIOT 


SIDE 


TENT 


TOOL 




WHAT 


NOON 


PIPE ’ 


ROAD 


SOME 


TEXT 


TOWN 




WHEN 


NOTE 


PLAN 


ROUT 


SOON 


THAN " " 


TYPE 




WILL 


OBOE 


POST 


RULE 


STOP 


THAT 


UNIT 




WIRE 


OMIT 


PUMP 


RUSH 


SUNK 


THEM 


VARY 




WITH 


ONCE 


Push 


SAID 


TAKE 


THEN 


•VERY 




XRAY 


ONLY 


RAID 


SAME 


TALK 


THEY 


WEAK 




YOKE 


OPEN 


RAIL 


SANK 


TANK 


THIS 


WEEK 


- 


ZERO 


ORAL 


RAIN 


SEEN 


TARE 


TIME 


WELL 




ZONE 


OVER 


RANK 


SHIP 


TASK 


TONS 


WERE 







FIVE LETTER WORDS 




ABOUT 


BOATS 


DECKS 


FIGHT 


LATER 


PRIOR 


SHIPS 


TITLE 


AFTER 


BOMBS 


DEFER 


FIRES 


LEAST 


PROOF 


SHORE 


TODAY 


AGAIN 


BOOTH 


DELAY 


FIRST 


LEAVE 


PROVE 


SIEGE 


TOTAL 


AGENT 


BREAK 


DEPOT 


FLANK 


LEVEL 


QUEEN 


SIGHT 


TRACT 


ALARM 


BRIBE 


DEPTH 


FLARE 


LIGHT 


QUICK 


SIXTH 


TRAIN 


ALERT 


BROKE 


DOCKS 


FLATS 


LIMIT 


QUIET 


SIXTY 


TROOP 


ALIGN 


BURST 


DRAWN 


FLEET 


LOCAL 


RADIO 


SLOPE 


TRUCE 


ALINE 


CANAL 


DRESS 


FOGGY 


MAJOR 


RAFTS 


SMALL 


TRUCK 


ALLOW 


CASES 


DRILL 


FORCE 


MARCH 


RAIDS 


SMOKE 


UNDER 


ALONG 


CAUSE 


DRIVE 


FORTY 


METER 


RALLY 


SOUTH 


UNION 


AMONG 


CEASE 


EAGER 


Fresh 


MILES 


RANGE 


SPEED 


UNITS 


ANNEX 


CHECK 


EARLY 


front 


MOTOR 


RAPID 


SPELL 


USUAL 


APPLY 


CHIEF 


EIGHT 


GATES 


NAVAL 


REACH 


SPLIT 


VALOR 


APRIL 


CLEAR 


ENEMY 


GAUGE 


NIGHT 


READY 


SQUAD 


VISIT 


AREAS 


CLERK 


ENTER 


GIVEN 


NINTH 


REFER 


STAFF 


VITAL 


ARMOR 


CLOSE 


EQUAL 


GOING 


NORTH 


REPEL 


STAKE 


VOCAL 


ASSET 


COAST 


EQUIP 


GROUP 


ORDER 


RIDGE 


START 


VOICE 


AWAIT 


COLON 


ERASE 


GUARD 


OTHER 


RIGHT 


X STEEL 


WAGON 


AWARD 


COMMA 


ERROR 


GUEST 


PACKS 


RIGID 


SUGAR 


WEIGH 


BAKER 


CORPS 


ETHER 


HEAVY 


PAIRS 


RIVER 


TAKEN 


WHEEL 


BANKS 


COUNT 


EVERY 


HONOR 


PARTY 


ROGER 


TANKS 


WHERE 


BARGE 


COVER 


FATAL 


Horse 


PETER 


ROUTE 


TENTH 


WHICH 


BEACH 


CREEK 


FEARS 


HOURS 


PLACE 


SCALE 


THEIR 


WIDTH 


BEGIN 


CREST 


FERRY 


HOUSE 


PLAIN 


SEIZE 


THERE 


WIPED 


BEING 


CROSS 


FIELD 


ISSUE 


PLANS 


SEVEN 


THESE 


WOODS 


BLACK 


CURVE 


FIFTH 


JAPAN 


POINT 


SHELL 


THIRD 


YARDS 


BLIND 


DAILY 


FIFTY 


LARGE 


PRESS 


SHIFT 


THREE 


ZEBRA 
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SIX LETTER WORDS 



ACCEPT 


BOMBED 


DEGREE 


FIERCE 


LESSON 


OTHERS 


RESUME 


SUFFER 


ACCESS 


BOMBER 


DEPART 


FILING 


LETTER 


OUTPUT 


RETIRE 


SUMMER 


ACROSS 


BOTTOM 


DEPEND 


FINISH 


LINING 


PANAMA 


RETURN 


SUMMIT 


ACTION 


BRANCH 


DEPLOY 


FIRING 


LIQUID 


PARADE 


REVIEW 


SUMMON 


ACTIVE 


BREACH 


DESERT 


FLIGHT 


LITTER 


PARLEY 


RIDING 


SUNDAY 


ADJUST 


BREEZE 


DETACH 


FLYING 


LITTLE 


PASSED 


ROCKET 


SUNKEN 


ADVICE 


BRIDGE 


DETAIL 


FOLLOW 


LOCATE 


PASSES 


ROUTED 


SUNSET 


ADVISE 


BROKEN 


DEVICE 


FORCES 


LOSSES 


PATROL 


ROUTES 


SUPPLY 


AFFAIR 


BUREAU 


DEVISE 


FORMAL 


MANAGE 


PERIOD 


RUBBER 


SURVEY 


ALASKA 


CANADA 


DIRECT 


FORMrD 


MANNER 


PICKET 


RUNNER 


SWITCH 


ALLEGE 


CANCEL 


DIVERT 


FOUGHT 


MANUAL 


PINCER 


SALARY 


SYSTEM 


ALLIED 


CANNOT 


DIVIDE 


FOURTH 


MEAGER 


PISTOL 


SCHEME 


TABLES 


ALLIES 


CANVAS 


DOCTOR 


FRIDAY 


MEDIUM 


PLACES 


SCHOOL 


TANKER 


ALWAYS 


CASUAL 


DOLLAR 


FUTURE 


MEMBER 


PLANES 


SCORED 


TARGET 


ANIMAL 


CAUSED 


DOWNED 


GARAGE 


METHOD 


POINTS 


SCREEN 


TATTOO 


ANNUAL 


CENTER 


DRYRUN 


GEORGE 


METRIC 


POISON 


SEAMAN 


TERROR 


ANYWAY 


CHANGE 


DUGOUT 


GREASE 


MINING 


POLICE 


SEAMEN 


THIRTY 


APPEAR 


CHARGE 


DURING 


GROUND 


MINUTE 


PONTON 


SEARCH 


JIfQUGH 


ARABIA 


CHEESE 


EFFECT 


GUNNER 


MIRROR 


POSTAL 


SECOND 


"I ■REAT 


ARMIES 


CHURCH 


EFFORT 


HALTED 


MOBILE 


PREFER 


SECTOR 


'I RAINS 


ARMORY 


CIPHER 


EIGHTH 


HAMMER 


MONDAY 


PROMPT 


SECURE 


TRENCH 


ARREST 


CIRCLE 


EIGHTY 


HAPPEN 


MORALE 


PPOPER 


SELECT 


TROOPS 


ARRIVE 


COFFEE 


EITHER 


HARBOR 


MORTAR 


PURSUE 


SERIAL 


TURRET 


ASSETS 


COLORS 


ELEVEN 


HELPER 


MOVING 


RADIAL 


SETTLE 


TWELVE 


ASSIST 


COLUMN 


EMBARK 


HIGHER 


MURDER 


RAIDED 


SEVERE 


TWENTY 


ASSURE 


COMBAT 


EMPLOY 


HOURLY 


MUZZLE 


RATION 


SHELLS 


UNABLE 


ATTACH 


COMMIT 


ENCODE 


INDEED 


NAUGHT 


RAVINE 


SIGCOM 


UNITED 


ATTACK 


COMMON 


ENGAGE 


INFORM 


NEARER 


RECORD 


SIGNAL 


UNLESS 


ATTAIN 


CONVEY 


ENGINE 


INLAND 


NINETY 


REDUCE 


SINGLE 


VALLEY 


AUGUST 


CONVOY 


ENROLL 


INTEND 


NORMAL 


REFILL 


SLIGHT 


VERBAL 


BANNER 


COURSE 


ENTIRE 


INTENT 


NOTING 


REFUGE 


SPHERE 


VERIFY 


BARBED 


CREDIT 


ERASER 


INVENT 


NOUGHT 


REFUSE 


SPOOLS 


VESSEL 


BARGES 


CRISIS 


ESCORT 


ISLAND 


NOVICE 


REJECT 


SPOONS 


VICTIM 


BATTEN 


CRITIC 


EUROPE 


ISSUES 


NOZZLE 


RELIEF 


STATES 


VICTOR 


BATTLE 


DAMAGE 


EXCEPT 


KEEPER 


NUMBER 


REMAIN 


STATUS 


VISITS 


BEETLE 


DEBARK 


EXCESS 


KILLED 


OCCUPY 


REMEDY 


STRAFE 


VISUAL 


BEFORE 


DECIDE 


EXCITE 


LADDER 


OFFEND 


REPAIR 


STREET 


WEIGHT 


BETTER 


DECODE 


EXPECT 


LANDED 


OFFICE 


REPORT 


STRESS 


WIRING 


BEYOND 


DECREE 


EXPELS 


LAUNCH 


OPPOSE 


RESCUE 


STRIPS 


WITHIN 


BILLET 


DEFEAT 


EXPEND 


LEADER 


ORDERS 


RESIST 


SUBMIT 


WOODED 


BITTER 


DEFECT 


EXTEND 


LEAGUE 


ORIENT 


RESULT 


SUDDEN 


ZIGZAG 


BODIES 


DEFEND 


EXTENT 













SEVEN LETTER WORDS 



ABANDON 


ALMANAC 


APPOINT 


ASIATIC 


AVIATOR 


BATTERY 


BETWEEN 


ABSENCE 


AMMETER 


APPROVE 


ASSAULT 


AWKWARD 


BATTLES 


BICYCLE 


ADDRESS 


ANALYZE 


ARMORED 


ATTACKS 


BAGGAGE 


BEARING 


BINDING 


ADVANCE 


ANOTHER 


ARRANGE 


ATTEMPT 


BALLOON 


BECAUSE 


BIVOUAC 


AGAINST 


ANTENNA 


ARRIVAL 


AVERAGE 


BARRAGE 


BEDDING 


BOMBARD 
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RESTRICTED 



SEVEN LETTER WORDS— Continued 



BOMBERS 


DEBOUCH 


FITTING 


LANDING 


PACKAGE 


REQUEST 


SUPPOSE 


BOMBING 


DECIDED 


FOGHORN 


LEADING 


PASSAGE 


REQUIRE 


SURPLUS 


BOYCOTT 


DECLARE 


FORCING 


LECTURE 


PASSIVE 


RESERVE 


SUSPEND 


BRIBERY 


DECODED 


FORGING 


LIAISON 


PATROLS 


RESPECT 


TACTICS 


BRIGADE 


DEFENSE 


FORWARD 


I IBRARY 


PAYROLL 


RESPOND 


TALKING 


CALIBER 


DELAYED 


FOXHOLE 


LICENSE 


PLACING 


RFTIRED 


TARGETS 


CALIBRE 


DELIVER 


FUELOIL 


LIFTING 


PLATdbN 


RETREAT 


TERRAIN 


CAPTAIN 


DERRICK 


FURNISH 


LbADlNG 


POUNDER 


REVENUE 


THATTHE 


CAPTIVE 


DESTROY 


FURTHER 


LOGICAL 


PRAIRIE 


REVERSE 


THROUGH 


CARRIER 


DETRAIN 


GASSING 


LOOKOUT 


PRECEDE 


REVOLVE 


TOBACCO 


CAVALRY 


DETRUCK 


GENERAL 


MACHINE 


PREPARE 


ROUTINE 


TONIGHT 


CENTRAL 


DEVELOP 


GETTING 


MANDATE 


PRESENT 


RUNNING 


TONNAGE 


CHANGES 


DIAGRAM 


GLASSES 


MANNING 


PRESSED 


SAILORS 


TORPEDO 


CHANNEL 


DISCUSS 


GRADUAL 


MAPPING 


PRIMARY 


SATISFY 


TRACTOR 


CHARLIE 


DISEASE 


GRENADE 


MARCHED 


PROCEED 


SECRECY 


TRAFFIC 


CHASSIS 


DISMISS 


GUARDED 


MARSHAL 


PROGRAM 


SECTION 


TRAWLER 


CIRCUIT 


DISTILL 


HALTING 


MARTIAL 


PROMOTE 


SECTORS 


TRIGGER 


CLIPPER 


DROPPED 


HASBEEN 


MAXIMUM 


PROPOSE 


SERVICE 


TUESDAY 


COASTAL 


EASTERN 


HEADING 


MEDICAL 


PROTECT 


SESSION 


TWELFTH 


COLLECT 


ECHELON 


HEAVIER 


MESSAGE 


PROTEST 


SETBACK 


UNKNOWN 


COLLEGE 


ELEMENT 


HIGHEST 


MESSING 


PROVOST 


SEVENTH 


UNUSUAL 


COLONEL 


ELEVATE 


HOLDING 


MILITIA 


PURPOSE 


SEVENTY 


USELESS 


COMMAND 


EMBASSY 


HORIZON 


MINIMUM 


PURSUIT 


SEVERAL 


UTILITY 


COMMEND 


ENCODED 


HOSTILE 


MISFIRE 


PUSHING 


SHELLED 


VACANCY 


COMMENT 


ENEMIES 


HUNDRED 


MISSING 


QUARTER 


SHORTLY 


VARYING 


COMMUTE 


ENFORCE 


ICEBERG 


MISSION 


QUICKLY 


SIGNIFY 


VESSELS 


COMPANY 


ENGAGED 


ILLEGAL 


MORNING 


RADIATE 


SIMILAR 


VICTORY 


COMPASS 


ENTENTE 


ILLNESS 


NATURAL 


RAIDING 


SIMPLEX 


VILLAGE 


CONCEAL 


ENTRAIN 


INCLUDE 


NEAREST 


RAILWAY 


SINKING 


VISIBLE 


CONDEMN 


ENTRUCK 


INFLICT 


NIGHTLY 


RAINING 


SIXTEEN 


VISITOR 


CONDUCT 


ENVELOP 


INITIAL 


NOTHING 


RAPIDLY 


SLOPING 


WARFARE 


CONFINE 


EXCLUDE 


INQUIRE 


NUMBERS 


REACHED 


SMOKING 


WARSHIP 


CONTACT 


EXPLAIN 


INQUIRY 


OBSERVE 


RECEIPT 


SOLDIER 


WEATHER 


CONTAIN 


EXPRESS 


INSPIRE 


OCTOBER 


RECEIVE 


STARTER 


WESTERN 


CONTROL 


EXTRACT 


INSTALL 


OFFENSE 


RECOVER 


STATION 


WHETHER 


CORRECT 


EXTREME 


INSTANT 


OFFICER 


RECRUIT 


STEAMER 


WILLIAM 


COUNCIL 


FALLING 


INVADED 


OMITTED 


REDUCED 


STOPPED 


WINDAGE 


COURIER 


FARTHER 


ISLANDS 


OPERATE 


REFUGEE 


STORAGE 


WITHOUT 


COVERED 


FEDERAL 


ISSUING 


OPINION 


REGULAR 


SUCCESS 


WITHTHE 


CROSSED 


FIFTEEN 


JANUARY 


ORDERED 


RELEASE 


SUGGEST 


WITNESS 


CRUISER 


FIGHTER 


JUMPOFF 


OUTPOST 


RELIEVE 


SUMMARY 


WOUNDED 


CURRENT/ 


FILLING 


KITCHEN 


OUTSIDE 


JREPAIRS 


SUNRISE 


WRECKED 


CYCLONE 


FINDING 


KILLING 


PACIFIC 


REPLACE 


SUPPORT 


WRITTEN 


DAMAGED 


FISHING 
















EIGHT 


LETTER WORDS 






ACTIVITY 


ADVANCED 


AIRBORNE 


AIRPLANE 


ANNOUNCE 


APPROACH 


ASSEMBLE 


ACTUALLY 


ADVANCES 


AIRCRAFT 


ALTITUDE 


ANTITANK 


APPROVAL 


ASSEMBLY 


Adjacent 


ADVISING 


AIRDROME 


AMERICAN 


APPARENT 


ARMAMENT 


ASSIGNED 


ADJUTANT 


ADVISORY 


AIRFIELD 


ANALYSIS 


APPEARED 


ARRESTED 


ASSOONAS 
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EIGHT LETTER WORDS— Continued 



ATLANTIC 


CRITIQUE 


DRIFTING 


FORENOON 


MEDICINE 


PRIORITY 


SERGEANT 


ATTACKED 


CROSSING 


EASTERLY 


FORTRESS 


MEMORIAL 


PRISONER 


SHELLING 


ATTEMPTS 


CRUISERS 


EASTWARD 


FOURTEEN 


MERCIFUL 


PROBABLE 


SHIPPING 


AVIATION 


DAMAGING 


ECONOMIC 


FRONTAGE 


MESSAGES 


PROBABLY 


SIGHTING 


BARRACKS 


DARKNESS 


EFFECTED 


FUSELAGE 


MIDNIGHT 


PROGRESS 


SKIRMISH 


BARRAGES 


DAYLIGHT 


EFFICACY 


GARRISON 


MILITARY 


PROHIBIT 


SOLDIERS 


BATTERED 


DECEMBER 


EIGHTEEN 


GROUNDED 


MISFIRES 


PROTESTS 


SOUTHERN 


BATTLING 


DECIPHER 


ELEMENTS 


GROUPING 


MISSIONS 


PROTOCOL 


SPECIFIC 


BESEIGED 


DECISION 


ELEVENTH 


GUARDING 


MOBILIZE 


PURPOSES 


SPOTTING 


BILLETED 


DECISIVE 


ELIGIBLE 


HAVEBEEN 


MONOPOLY 


QUARTERS 


SQUADRON 


BOUNDARY 


DECLARED 


EMPLOYEE 


HINDERED 


MOUNTAIN 


RAILHEAD 


STANDARD 


BREAKING 


DECREASE 


EMPLOYER 


HOSPITAL 


MOVEMENT 


RAILROAD 


STATIONS 


BUILDING 


DEDICATE 


ENCIPHER 


HOWITZER 


NATIONAL 


RALLYING 


STRATEGY 


BULLETIN 


DEFEATED 


ENCIRCLE 


IDENTIFY 


jNAUTICAL 


RECEIVER 


SUFFERED 


BUSINESS 


DEFENDED 


ENFILADE 


IGNITION 


NINETEEN 


RECORDER 


SUITABLE 


CALAMITY 


DEFENDER 


ENGAGING 


IMPROPER 


'northern 


REDCROSS 


SUPERIOR 


CAMPAIGN 


DEFENSES 


ENGINEER 


IMPROVED 


NOVEMBER 


REENLIST 


SUPPLIES 


CANISTER 


DEFERRED 


ENLISTED 


INCIDENT 


{OBSERVED 


REGIMENT 


SURPRISE 


CAPACITY 


DEFINITE 


ENORMOUS 


INDICATE 


OBSERVER 


REGISTER 


SURROUND 


CAPTURED 


DELAYING 


ENROLLED 


INDIRECT 


{obsolete 


REJECTED 


SURVIVED 


CARELESS 


DEMANDED 


ENTERING 


INFANTRY 


■OBSTACLE 


REJECTOR 


SUSPENSE 


CARRIAGE 


DEPARTED 


ENTRENCH 


INFECTED 


OCCUPIED 


REMEDIES 


SWEEPING 


CARRIERS 


DEPLOYED 


ENVELOPE 


INITIATE 


OFFENDED 


REMEMBER 


SWIMMING 


CARRYING 


DEPORTED 


EQUALIZE 


INSECURE 


OFFICERS 


REPAIRED 


TACTICAL 


CASUALTY 


DESCRIBE 


EQUIPAGE 


INSIGNIA 


OFFICIAL 


REPEATER 


TAXATION 


CAUSEWAY 


DESERTED 


ESCORTED 


INSTRUCT 


OPERATOR 


REPELLED 


TELEGRAM 


CEMETERY 


DESERTER 


ESTIMATE 


INTEREST 


OPPOSING 


REPLACED 


TERRIBLE 


CENTERED 


DESPATCH 


EUROPEAN 


INTERIOR 


OPPOSITE 


REPORTED 


TERRIFIC 


CHAPLAIN 


DETACHED 


EVACUATE 


INTERNAL 


ORDINATE 


REPULSED 


THATHAVE 


CHEMICAL 


DETECTOR 


EXCAVATE 


INTRENCH 


ORDNANCE 


REQUIRED 


THIRTEEN 


CIRCULAR 


DETONATE 


EXCHANGE 


INVADING 


OUTBOARD 


RESEARCH 


THOUSAND 


CITATION 


DEVELOPE 


EXERCISE 


INVASION 


OUTGUARD 


RESERVES 


THURSDAY 


CIVILIAN 


DICTATED 


EXPANDED 


INVENTED 


tlUTPOSTS 


RESPECTS 


TOMORROW 


CLERICAL 


DICTATOR 


EXPEDITE 


JETPLANE 


PAINTING 


RESTORED 


TOTALING 


CODEBOOK 


DIMINISH 


EXPELLED 


JUNCTION 


PARALLAX 


RETIRING 


TRAILERS 


COMMANDS 


DIRECTOR 


EXPENDED 


LANGUAGE 


PARALLEL 


RETURNED 


TRAINING 


COMMENCE 


DISARMED 


EXPENSES 


LATITUDE 


PASSPORT 


REVIEWED 


TRANSFER 


COMMERCE 


DISASTER 


EXTENDED 


LETTERED 


PLANNING 


REVOLVER 


TRAVERSE 


COMPLETE 


DISLODGE 


EXTERIOR 


LIMITING 


POLITICS 


RIGOROUS 


TRAWLERS 


COMPOSED 


DISPATCH 


FACTIONS 


LOCATION 


PONTOONS 


SABOTAGE 


VEHICLES 


CONCLUDE 


DISPERSE 


FATALITY 


LUMINOUS 


POSITION 


SANITARY 


VICINITY 


CONCRETE 


DISTANCE 


FEBRUARY 


MAINTAIN 


POSITIVE 


SATURDAY 


VIGOROUS 


CONFLICT 


DISTRESS 


FERRYING 


MANDATED 


POSSIBLE 


SCHEDULE 


WARSHIPS 


CONGRESS 


DISTRICT 


FIGHTERS 


MANEUVER 


PpSTPONE 


SEABORNE 


WESTERLY 


CONTINUE 


DIVIDING 


FIGHTING 


MARCHING 


PREPARED 


SEALEVEL 


WESTWARD 


CONTRACT 


DIVISION 


FINISHED 


MARITIME 


PkESERVE 


SELECTED 


WINDWARD 


CORPORAL 


DOCTRINE 


FLANKING 


MATERIAL 


PRESSING 


SENTENCE 


WIRELESS 


CORRIDOR 


DOMINANT 


FLEXIBLE 


MATERIEL 


PRESSURE 


SENTINEL 


WITHDRAW 


COVERING 


DRESSING 


FOOTHOLD 


MECHANIC 


PRINTING 


SEPARATE 


WITHDREW 
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ACCESSORY 

ACCOMPANY 

ACCORDING 

ADDRESSED 

ADDRESSES 

ADMISSION 

ADVANCING 

ADVANTAGE 

AERODROME 

AEROPLANE 

AFTERNOON 

AGREEMENT 

AIRDROMES 

AIRPLANES 

ALLOTMENT 

ALLOWANCE 

ALTERNATE 

AMBULANCE 

AMUSEMENT 

ANNOUNCED 

ANONYMOUS 

APPARATUS 

APPOINTED 

ARBITRARY 

ARTILLERY 

ASCENSION 

ASSAULTED 

ASSISTANT 

ASSOCIATE 

ASSURANCE 

ATTACKING 

ATTEMPTED 

ATTENTION 

AUTOMATIC 

AVAILABLE 

BALLISTIC 

BAROMETER 

BATTALION 

BATTERIES 

BEACHHEAD 

BEGINNING 

BLOCKADED 

BOMBARDED 

BRIGADIER 

BUILDINGS 

CABLEGRAM 

CAMPAIGNS 

CANCELLED 

CARTRIDGE 



CENTERING 
CHALLENGE 
CHARACTER 
CHAUFFEUR 
CHRONICAL 
CIGARETTE 
CIRCULATE 
CIVILIANS 
CLEARANCE 
COALITION 
COLLAPSED 
COLLISION 
COMBATANT 
COMMANDED 
COMMANDER 
COMMITTEE 
COMPANIES 
COMPELLED 
COMPLETED 
CONDEMNED 
CONDENSED 
CONDITION 
CONFERRED 
CONFIDENT 
CONFLICTS 
CONQUERED 
CONTINUAL 
- CONTINUED 
CONTINUES 
COOPERATE 
CORRECTED 
CRITICISE 
CRITICISM 
DEBARKING 
DECREASED 
DEFECTIVE 
DEFENSIVE 
DEFICIENT 
DEPARTURE 
DEPENDENT 
DESCRIBED 
DESIGNATE 
DESTITUTE 
DESTROYED 
DESTROYER 
DETENTION 
DETERMINE 
DETONATED 
DETRAINED 



DEVELOPED 

DIETITIAN 

DIFFERENT 

DIFFICULT 

DIMENSION 

DIRECTION 

DIRIGIBLE 

DISAPPEAR 

DISCUSSED 

DISINFECT 

DISMISSAL 

DISPERSED 

DISTRICTS 

DIVISIONS 

DOMINANCE 

DOMINATED 

ECHELONED 

EFFECTIVE 

EFFICIENT 

ELABORATE 

ELEVATION 

ELSEWHERE 

EMBASSIES 

EMERGENCY 

EMPLOYING 

ENDURANCE 

ENGINEERS 

ENLISTING 

ENTRAINED 

EQUIPMENT 

ESTABLISH 

ESTIMATED 

ESTIMATES 

EXCESSIVE 

EXCLUSION 

EXCLUSIVE 

EXECUTIVE 

EXERCISES 

EXHIBITED 

EXPANSION 

EXPANSIVE 

EXPENSIVE 

EXPLOSION 

EXPLOSIVE 

EXTENDING 

EXTENSION 

EXTENSIVE 

FIFTEENTH 

FIREALARM 



FORMATION 

FORTIFIED 

FRONTLINE 

GROUPMENT 

GYROMETER 

HOSTILITY 

HURRICANE 

IDENTICAL 

IMMEDIATE 

IMPORTANT 

IMPRESSED 

INCENTIVE 

INCIDENCE 

INCIDENTS 

INCLINING 

INCLUDING 

INCLUSIVE 

INCREASED 

INDEMNITY 

INDICATED 

INFLATION 

INFLICTED 

INFLUENCE 

INHABITED 

INSTANTLY 

INTEGRITY 

INTENSIVE 

INTENTION 

INTERCEPT 

INTERDICT 

INTERFERE 

INTERMENT 

INTERPOSE 

INTERRUPT 

INTERVENE 

INTERVIEW 

INVENTION 

IRREGULAR 

KILOMETER 

LAUNCHING 

LIABILITY 

LOGISTICS 

LONGITUDE 

MAINTAINS 

MANGANESE 

MECHANISM 

MEMORANDA 

MESSENGER 

MOTORIZED 



MOVEMENTS 

MUNITIONS 

NAVALBASE 

NECESSARY 

NECESSITY 

NEGLIGENT 

NEWSPAPER 

NORTHEAST 

NORTHERLY 

NORTHWARD 

NORTHWEST 

NUMBERING 

OBJECTION 

OBJECTIVE 

OBTAINING 

OCCUPYING 

OFFENSIVE 

OFFICIALS 

OPERATING 

OPERATION 

OSCILLATE 

OUTSKIRTS 

PARACHUTE 

PARAGRAPH 

PARTITION 

PASSENGER 

PATRIOTIC 

PENETRATE 

PERMANENT 

PERSONNEL 

PLACEMENT 

POLITICAL 

POPULATED 

POSITIONS 

PRACTICAL 

PRECEDING 

PREFERRED 

PREMATURE 

PREPARING 

PRESIDENT 

PRINCIPAL 

PRINCIPLE 

PRISONERS 

PROCEDURE 

PROCEEDED 

PROJECTOR 

PROMOTION 

PROPOSALS 

PROTECTED 



PROTECTOR 

PROTESTED 

PROVISION 

PROXIMITY 

RADIATION 

RADIOGRAM 

READINESS 

REARGUARD 

REBELLION 

RECEIVING 

RECOGNIZE 

RECOMMEND 

REENFORCE 

REFERENCE 

REFILLING 

REGARDING 

REINFORCE 

REINSTATE 

REMAINDER 

REMAINING 

REPRESENT 

REPRISALS 

REQUESTED 

REQUIRING 

RESOURCES 

RESTRAINT 

RETENTION 

RETURNING 

REVIEWING 

SCREENING 

SEAPLANES 

SECRETARY 

SEMICOLON 

SEMIRIGID 

SEPTEMBER 

SERIOUSLY 

SERVICING 

SEVENTEEN 

SHELLFIRE 

SITUATION 

SIXTEENTH 

SOUTHEAST 

SOUTHWARD 

SOUTHWEST 

SPEARHEAD 

STANDARDS 

STATEMENT 

STRAGGLER 

STRATEGIC 
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SUBMITTED 

SUCCEEDED 

SURRENDER 

SUSPECTED 



ACCEPTABLE 

ACCEPTANCE 

ACCIDENTAL 

ACCORDANCE 

ACTIVITIES 

ADDITIONAL 

AIRCONTROL 

AIRSUPPORT 

ALLEGIANCE 

ALLOCATION 

AMBASSADOR 

AMMUNITION 

ANTEDATING 

ANTICIPATE 

APPARENTLY 

APPEARANCE 

APPROACHED 

ARMOREDCAR 

ARTIFICIAL 

ASPOSSIBLE 

ASSEMBLIES 

ASSESSMENT 

ASSIGNMENT 

ASSISTANCE 

ATOMICBOMB 

ATTACHMENT 

ATTAINMENT 

ATTEMPTING 

AUDIBILITY 

AUTOMOBILE 

BALLISTICS 

BATTLESHIP 

BEENNEEDED 

BRIDGEHEAD 

CAMOUFLAGE 

CAPABILITY 

CASUALTIES 

CENSORSHIP 

CENTRALIZE 

CIRCUITOUS 

COASTGUARD 

COLLECTING 

COLLECTION 



SUSPENDED 

SUSPICION 

TECHNICAL 

TECHNIQUE 



TELEPHONE THEREFORE UNTENABLE 
TENTATIVE TRANSPORT VARIATION 
TERRITORY TWENTIETH WATERTANK 



COLLISIONS 

COMMANDANT 

COMMANDEER 

COMMANDING 

COMMISSARY 

COMMISSION 

COMMITMENT 

COMMUNIQUE 

COMPENSATE 

COMPLETELY 

COMPRESSED 

CONCERNING 

CONCESSION 

CONCLUSION 

CONDITIONS 

CONFERENCE 

CONFESSION 

CONFIDENCE 

CONNECTING 

CONNECTION 

CONSPIRACY 

CONSTITUTE 

CONTINGENT 

CONTINUOUS 

CONTRABAND 

CONVENIENT 

COORDINATE 

CORRECTION 

CREDENTIAL 

CROSSROADS 

DEBOUCHING 

DECIPHERED 

DECORATION 

DEDICATION 

DEFICIENCY 

DEFINITION 

DEMOBILIZE 

DEPARTMENT 

DEPENDABLE 

DEPLOYMENT 

DEPRESSION 

DESIGNATED 

DESPATCHED 



TEN LETTER WORDS 

DESPATCHES 

DESTROYERS 

DETACHMENT 

DETERMINED 

DETONATION 

DETRAINING 

DETRUCKING 

DIFFERENCE 

DIPLOMATIC 

DIRECTIONS 

DISCIPLINE 

DISCUSSION 

DISPATCHED 

DISPATCHER 

DISPATCHES 

DISPERSION 

DISTRESSED 

DISTRIBUTE 

DIVEBOMBER 

DOMINATION 

EFFICIENCY 

EIGHTEENTH 

ELEMENTARY 

EMPLOYMENT 

ENCIPHERED 

ENCIRCLING 

ENEMYTANKS 

ENGAGEMENT 

ENLISTMENT 

ENROLLMENT 

ENTERPRISE 

ENTRENCHED 

ENTRUCKING 

EQUIVALENT 

ESTIMATION 

EVACUATING 

EVACUATION 

EVALUATION 

EXCAVATION 

EXCITEMENT 

EXHIBITION 

EXPEDITING 

EXPEDITION 



EXPENDABLE 

EXPERIENCE 

EXPERIMENT 

EXPLOSIONS 

EXTINGUISH 

FACILITIES 

FLASHLIGHT 

FORMATIONS 

FOUNDATION 

FOURTEENTH 

FRONTLINES 

GEOGRAPHIC 

GONIOMETER 

GOVERNMENT 

GYROSCOPIC 

HYDROMETER 

HYGROMETER 

ILLITERATE 

ILLUMINATE 

ILLUSTRATE 

IMPASSIBLE 

IMPOSSIBLE 

IMPRESSION 

IMPRESSIVE 

INCENDIARY 

INDICATING 

INDICATION 

INDIVIDUAL 

INFLICTING 

INSECURITY 

INSPECTION 

INSTRUCTED 

INSTRUCTOR 

INSTRUMENT 

INTERNMENT 

INVITATION 

IRRIGATION 

KILOMETERS 

LABORATORY 

LIEUTENANT 

LIMITATION 

LOCOMOTIVE 

MACHINEGUN 



WEDNESDAY 

WITNESSES 

YESTERDAY 



MAINTAINED 

MANAGEMENT 

MECHANIZED 

MEMORANDUM 

MILLIMETER 

MOTORCYCLE 

NATURALIZE 

NAVIGATION 

NEGLIGENCE 

NEWSPAPERS 

NINETEENTH 

OBJECTIVES 

OCCUPATION 

ONEHUNDRED 

OPERATIONS 

OPPOSITION 

OVERCOMING 

PATROLLING 

PERMISSION 

PERSISTENT 

PHOSPHORUS 

POPULATION 

POSSESSION 

POSTOFFICE 

PRECEDENCE 

PREFERENCE 

PRESCRIBED 

PROHIBITED 

PROPORTION 

PROTECTION 

PROVISIONS 

QUARANTINE 

RECEPTACLE 

RECREATION 

RECRUITING 

REENFORCED 

REENLISTED 

REGIMENTAL 

REGULATION 

REINFORCED 

RESISTANCE 

RESPECTFUL 
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REVOLUTION 

SANITATION 

SEPARATION 

SIGNALLING 

SIMILARITY 

STATISTICS 

SUBMARINES 



ACCESSORIES 

AERONAUTICS 

ALTERNATING 

APPLICATION 

APPOINTMENT 

APPROACHING 

APPROPRIATE 

APPROXIMATE 

ARBITRATION 

ARMOREDCARS 

ARRANGEMENT 

ASSESSMENTS 

ASSIGNMENTS 

ASSOCIATION 

BATTLEFIELD 

BATTLESHIPS 

BELLIGERENT 

BLOCKBUSTER 

BOMBARDMENT 

CATASTROPHE 

CERTIFICATE 

CIRCULATION 

COEFFICIENT 

COINCIDENCE 

COMMUNICATE 

COMMUNIQUES 

COMPARTMENT 

COMPETITION 

COMPOSITION 

COMPUTATION 

CONCEALMENT 



ADVANTAGEOUS 

AGRICULTURAL 

ANNOUNCEMENT 

ANTIAIRCRAFT 

ANTICIPATION 

BREAKTHROUGH 

CANCELLATION 



TliN I El 1ER WORDS— Continued 



SUBMISSION 

SUBSTITUTE 

SUCCESSFUL 

SUCCESSIVE 

SUFFICIENT 

SUPPORTING 



SUSPENSION 

SUSPICIONS 

SUSPICIOUS 

THIRTEENTH 

THREATENED 

TRAJECTORY 



TRANSPORTS 

TRANSVERSE 

TROOPSHIPS 

TWENTYFJVE 

UNDERSTAND 

understood 



ELEVEN LETTER WORDS 



CONCENTRATE 

CONFINEMENT 

CONSTITUTED 

CONSUMPTION 

CONTINENTAL 

CONTROVERSY 

COOPERATION 

CORPORATION 

CORRECTNESS 

CREDENTIALS 

CUSTOMHOUSE 

DEBARKATION 

DEMONSTRATE 

DESCRIPTION 

DESCRIPTIVE 

DESIGNATION 

DESTRUCTION 

DETERIORATE 

DEVELOPMENT 

DISAPPEARED 

DISCONTINUE 

DISCREPANCY 

DISINFECTED 

DISPOSITION 

DISTINCTION 

DISTINGUISH 

DYNAMOMETER 

ECHELONMENT 

EFFECTIVELY 

ELECTRICITY 

EMBARKATION 



EMPLACEMENT 

ENCOUNTERED 

ENEMYPLANES 

ENFORCEMENT 

ENGAGEMENTS 

ENGINEERING 

ESTABLISHED 

ESTIMATEDAT 

EXAMINATION 

EXPLANATION 

EXTENSIVELY 

EXTERMINATE ' 

FINGERPRINT , 

FIRECONTROL ! 

HEAVYBOMBER 

HEAVYLOSSES 1 

HOSTILITIES 

IMMEDIATELY 

IMMIGRATION 

IMPEDIMENTA 

IMPROVEMENT 

INCOMPETENT 

INDEPENDENT 

INFLAMMABLE 

INFORMATION 

INSPIRATION 

INSTITUTION 

INSTRUCTION 

INSTRUMENTS 

INTELLIGENT 

INTERCEPTED 



IHTERfcERPTS 

INTERESTING 

INTERFERING 

INTERPRETER 

INTERRUPTED 

INTERVENING 

INVESTIGATE 

LEGISLATION 

LIGHTBOMBER 

MAINTENANCE 

MANUFACTURE 

MEASUREMENT 

NATIONALISM 

NATIONALITY 

NAVALATTACK 

NAVALBATTLE 

NAVALFORCES 

NECESSITATE 

OBSERVATION 

OVERWHELMED 

PARENTHESIS 

PARENTHESES 

PENETRATION 

PERFORMANCE 

PHILIPPINES 

PHOTOGRAPHY 

PREARRANGED 

PREPARATION 

PRELIMINARY 

PROGRESSIVE 

RANGEFINDER 



TWELVE LETTER WORDS 



CARELESSNESS 

COMMENCEMENT 

COMMENDATION 

COMMISSIONED 

COMMISSIONER 

COMPENSATION 

COMPLETENESS 



CONCENTRATED 

CONCILIATION 

CONFIDENTIAL 

CONFIRMATION 

CONFISCATION 

CONFORMATION 

CONSCRIPTION 



CONSIDERABLE 

CONSTITUTING 

CONSTITUTION 

CONSTRUCTION 

CONTINUATION 

CONVALESCENT 

CONVERSATION 
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UNEXPENDED 

UNSUITABLE 

VICTORIOUS 

VISIBILITY 

WILLATTACK 

WITHDRAWAL 



REAPPOINTED 

RECOGNITION 

RECOMMENDED 

RECONNOITER 

REPLACEMENT 

REQUIREMENT 

REQUISITION 

RESERVATION 

RESIGNATION 

RESPONSIBLE 

RESTRICTION 

RETALIATION 

RETROACTIVE 

SCHOOLHOUSE 

SEVENTEENTH 

SEVENTYFIVE 

SIGNIFICANT 

SMOKESCREEN 

STRATEGICAL 

SUBSISTENCE 

SUITABILITY 

SUPERIORITY 

SURRENDERED 

SYNCHRONIZE 

TEMPERATURE 

THERMOMETER 

TOPOGRAPHIC 

TRADITIONAL 

TRANSFERRED 

WITHDRAWING 



COORDINATION 

DECENTRALIZE 

DECIPHERMENT 

DEMONSTRATED 

DEPARTMENTAL 

DIFFICULTIES 

DISORGANIZED 
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DISPLACEMENT 

DISSEMINATED 

DISTRIBUTING 

DISTRIBUTION 

EMPLACEMENTS 

ENCIPHERMENT 

ENTANGLEMENT 

ENTERPRISING 

FIGHTERPLANE 

GENERALALARM 

GENERALSTAFF 

GEOGRAPHICAL 

HEADQUARTERS 

HEAVYBOMBERS 



ACCOMMODATION 

APPROXIMATELY 

CHRONOLOGICAL 

CIRCUMSTANCES 

COMMUNICATION 

CONCENTRATING 

CONCENTRATION 

CONGRESSIONAL 

CONSIDERATION 



ADMINISTRATION 

ADMINISTRATIVE 

CENTRALIZATION 

CHARACTERISTIC 

CIRCUMSTANTIAL 

CLASSIFICATION 

CORRESPONDENCE 
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HYDROGRAPHIC 

ILLUMINATING 

ILLUMINATION 

ILLUSTRATION 

INAUGURATION 

INCOMPETENCE 

INEFFICIENCY 

INSTRUCTIONS 

INTELLIGENCE 

INTERDICTION 

INTERFERENCE 

INTERMEDIATE 

INTERRUPTION 

INTERVENTION 



INTRODUCTION 

INTRODUCTORY 

IRREGULARITY/ 

LIGHTBOMBERS 

MARKSMANSHIP 

MEASUREMENTS 

MEDIUMBOMBER 

MOBILIZATION 

NONCOMBATANT 

NORTHWESTERN 

OBSTRUCTIONS 

ORGANIZATION 

PREPARATIONS 

PREPAREDNESS 



PRESERVATION 

PRESIDENTIAL 

PROCLAMATION 

PSYCHROMETER 

RADIOSTATION 

RECREATIONAL 

REENLISTMENT 

REGISTRATION 

REPLACEMENTS 

RESPECTFULLY 

ROADJUNCTION 

SATISFACTORY 

SEARCHLIGHTS 

SHARPSHOOTER 



THIRTEEN LETTER WORDS 



CORRESPONDING 

COUNTERATTACK 

DECENTRALIZED 

DEMONSTRATION 

DEPENDABILITY 

DETERMINATION 

DISAPPEARANCE 

DISCREPANCIES 

DISSEMINATION 



DISTINGUISHED 

ENTERTAINMENT 

ESTABLISHMENT 

EXTERMINATION 

EXTRAORDINARY 

FIGHTERPLANES 

IMPRACTICABLE 

INDETERMINATE 

INSTALLATIONS 



INSTANTANEOUS 
INTERNATIONAL 
INVESTIGATION 
MEDIUMB OMBERS 
MISCELLANEOUS 
PRELIMINARIES 
QUALIFICATION 
QUARTERMASTER 
REAPPOINTMENT 



FOURTEEN LETTER WORDS 



DEMOBILIZATION 

DISCONTINUANCE 

DISTINGUISHING 

IDENTIFICATION 

INTERPRETATION 

INVESTIGATIONS 



IRREGULARITIES 

METEOROLOGICAL 

NATURALIZATION 

RECOMMENDATION 

RECONNAISSANCE 

RECONNOITERING 
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SIGNIFICANCE 

SIMULTANEOUS 

SOUTHWESTERN 

SUBSTITUTION 

SUCCESSFULLY 

TRANSFERRING 

TRANSMISSION 

TRANSPACIFIC 

UNIDENTIFIED 

UNITEDSTATES 

UNSUCCESSFUL 

VERIFICATION 

VETERINARIAN 



REENFORCEMENT 

REIMBURSEMENT 

REINFORCEMENT 

REINSTATEMENT 

REVOLUTIONARY 

SPECIFICATION 

TRANSATLANTIC 

WARDEPARTMENT 



RECONSTRUCTION 

REORGANIZATION 

REPRESENTATIVE 

RESPONSIBILITY 

SATISFACTORILY 

TRANSPORTATION 
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B. LIST OF WORDS USED IN MILITARY TEXT ARRANGED IN RHYMING ORDER 

ACCORDING TO WORD LENGTH 



THREE LETTER WORDS 



SEA 


SEE 


MAJ 


TAN 


TOP 


iEAT 


APT 


TAX 


JOB 


AGE 


APJ 


GEN 


GHQ 


■MAT 


BUT 


FIX 


ROB 


SHE 


ASK 


MEN 


BAR 


VAT 


CUT 


MIX 


TUB 


THE 


GAL 


PEN 


CAR 


ACT 


OUT 


SIX 


iQMC 


DIE 


ALL 


TEN 


FAR 


GET 


PUT 


BOX 


ARC 


ONE 


ILL 


PIN 


PAR 


LET 


PVT 


FOX 


BAD 


ARE 


COL 


TIN 


WAR 


NET 


CWT 


DAY 


HAD 


USE 


v CPL 


TON 


HER 


SET 


YOU 


LAY 


ADD 


DUE 


CAM 


WON 


PER 


WET 


CAV 


PAY 


RED 


OWE 


HAM 


DUN 


AIR 


YET 


LAW 


SAY 


AID 


EYE 


AIM 


GUN 


>R 


SGT 


SAW 


WAY 


BID 


OFF 


HIM 


RUN 


J 1 OUR 


WGT 


FEW 


ANY 


DID 


BAG 


ARM 


SUN 


GAS 


FIT 


NEW 


SPY 


RID 


KEG 


SUM 


OWN 


HAS 


GOT 


HOW 


DRY 


OLD 


BIG 


CAN 


AGO 


WAS 


LOT 


LOW 


TRY 


AND 


JIG 


MAN 


TOO 


HIS 


NOT 


NOW 


BUY 


END 


DOG 


NAN 


TWO 


ITS 









FOUR LETTPU WORDS 



AREA 


MIKE 


BASE 


WEEK 


FELL 


JOIN 


PASS 


LIST 


ASIA 


YOKE 


FUSE 


TALK 

BULK 


well 


NOON 


LESS 


LOST 


BULB 


ABLE 


DATE 


HILL 


SOON 


MESS 


POST 


BOMB 


FILE 


LATE 


RANK 


WILL 


DOWN 


LOSS 


JUST 


HEAD 


MILE 


NOTE 


SANK 


FULL 


TOWN 


HITS 


ROUT 


LEAD 


MULE 


BLUE 


TANK 


TOOL 


ZERO 


DAYS 


NEXT 


LOAD 


RULE 


HAVE 


SUNK 


TEAM 


ALSO 


MEAT 


TEXT 


ROAD 


SAME 


FIVE 


BOOK 


.THEM 


INTO 


\ THAT 


LIEU 


RAID 


TIME 


LOVE 


COOK 


ITEM 


KEEP 


WHAT 


DRAW 


SAID 


SOME 


MOVE , 


HOOK 


MAIM 


SHIP 


feet 


XRAY 


HOLD 


LINE 


FUZE 


LOOK 


FROM 


DUMP 


MEET 


AWAY 


HAND 


MINE 


HALF 


TOOK 


FARM 


PUMP 


LEFT 


BODY 


LAND 


NINE 


FLAG 


"DARK 


FIRM 


STOP 


OMIT 


THEY 


KIND 


ZONE 


KING 


PARK 


FORM 


NEAR 


UNIT 


ALLY 


HARD 


JUNE 


LONG " 


mask 


THAN 


REAR 


HALT 


ONLY 


HERD 


OBOE 


EACH 


TASK 


PLAN 


OVER 


TENT 


JULY 


ONCE 


PIPE 


HIGH 


ORAL 


BEEN 


FOUR 


SHOT 


ARMY 


MADE 


TYPE 


DASH 


MTCL 


SEEN 


EYES 


RIOT 


MANY 


AIDE 


TARE 


pusp 


FEEL 


THEN 


THIS 


DIRT v 


VARY 


SIDE 


HERE 


RUSH 


RAIL 


WHEN 


AXIS 


EAST 


VERY 


CODE 


WERE 


WITH 


CALL 


OPEN 


TONS 


FAST 


EASY 


FLEE 


FIRE 


BOTH 


FALL 


MAIN 


GUNS 


LAST 


CITY 


EDGE 

TAKE 


WIRE 

MORE 


LEAK 

BACK 


CELL 


RAIN 


MASS 


WEST 


NAVY 
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FIVE LETTER WORDS 



COMMA 


SCALE 


ALONG 


CANAL 


WAGON 


PRIOR 


DRESS 


START 


ZEBRA 


TITLE 


AMONG 


FATAL 


UNION 


MAJOR 


PRESS 


ALERT 


SQUAD 


ALINE 


BEACH 


VITAL 


COLON 


VALOR 


CROSS 


LEAST 


SPEED 


SLOPE 


REACH 


TOTAL 


DRAWN 


ARMOR 


FLATS 


COAST 


WIPED 


FLARE 


WHICH 


EQUAL 


RADIO 


HONOR 


BOATS 


CREST 


RIGID 


THERE 


MARCH 


USUAL 


EQUIP 


ERROR 


RAFTS 


GUEST 


RAPID 


WHERE 


WEIGH 


NAVAL 


TROOP 


MOTOR 


UNITS 


FIRST 


FIELD 


SHORE 


FRESH 


WHEEL 


GROUP 


AREAS 


TRACT 


BURST 


BLIND 


CEASE 


WIDTH 


STEEL 


CLEAR 


BOMBS 


FLEET 


ABOUT 


GUARD 


ERASE 


FIFTH 


REPEL 


SUGAR 


RAIDS 


QUIET 


ALLOW 


AWARD 


THESE 


TENTH 


LEVEL 


UNDER 


WOODS 


ASSET 


ANNEX 


THIRD 


CLOSE 


NINTH 


APRIL 


ORDER 


YARDS 


SHIFT 


TODAY 


BRIBE 


HORSE 


BOOTH 


SMALL 


DEFER 


MILES 


EIGHT 


DELAY 


PLACE 


CAUSE 


DEPTH 


SHELL 


REFER 


FIRES 


FIGHT 


READY 


VOICE 


HOUSE 


NORTH 


SPELL 


EAGER 


CASES 


LIGHT 


FOGGY 


FORCE 


ROUTE 


SOUTH 


DRILL 


ROGER 


GATES 


NIGHT 


DAILY 


TRUCE 


ISSUE 


SIXTH 


ALARM 


ETHER 


PACKS 


RIGHT 


RALLY 


THREE 


LEAVE 


BREAK 


JAPAN 


OTHER 


DECKS 


SIGHT 


APPLY 


RIDGE 


DRIVE 


BLACK 


QUEEN 


BAKER 


'DOCKS 


AWAIT 


EARLY 


SIEGE 


PROVE 


CHECK 


TAKEN 


LATER 


BANKS 


SPLIT 


ENEMY 


RANGE 


CURVE 


QUICK 


SEVEN 


METER 


TANKS 


LIMIT ' 


EVERY 


BARGE 


SEIZE 


TRUCK 


GIVEN 


PETER 


PLANS 


VISIT 


FERRY 


LARGE 


CHIEF 


CREEK 


ALIGN 


AFTER 


SHIPS 


AGENT 


FIFTY 


GAUGE 


STAFF 


FLANK 


AGAIN 


ENTER 


CORPS 


POINT 


PARTY 


STAKE 


PROOF 


CLERK 


PLAIN 


RIVER 


FEARS 


FRONT 


FORTY 


SMOKE 


BEING 


LOCAL 


TRAIN 


COVER 


PAIRS 


COUNT 


SIXTY 


BROKE 


GOING 


VOCAL 


BEGIN 


THEIR 


HOURS 


DEPOT 


HEAVY 








SIX LETTER WORDS 








CANADA 


HALTED 


DEVICE 


CHARGE 


SEVERE 


ARRIVE 


TRENCH 


MANUAL 


ARABIA 


ROUTED 


NOVICE 


GEORGE 


RETIRE 


ACTIVE 


LAUNCH 


ANNUAL 


ALASKA 


LIQUID 


FIERCE 


REFUGE 


ENTIRE 


TWELVE 


SEARCH 


CASUAL 


PANAMA 


INLAND 


REDUCE 


MORALE 


BEFORE 


BREEZE 


CHURCH 


VISUAL 


METRIC 


ISLAND 


PARADE 


UNABLE 


SECURE 


RELIEF 


SWITCH 


CANCEL 


CRITIC 


DEFEND 


DECIDE 


CIRCLE 


ASSURE 


ZIGZAG 


THOUGH 


VESSEL 


BOMBED 


OFFEND 


DIVIDE 


SINGLE 


FUTURE 


RIDING 


FINISH 


DETAIL 


BARBED 


DEPEND 


DECODE 


MOBILE 


GREASE 


FILING 


EIGHTH 


REFILL 


RAIDED 


EXPEND 


ENCODE 


BEETLE 


CHEESE 


LINING 


FOURTH 


ENROLL 


LANDED 


INTEND 


COFFEE 


BATTLE 


ADVISE 


Mining 


ATTACK 


SCHOOL 


WOODED 


EXTEND 


DECREE 


SETTLE 


DEVISE 


FIRING 


DEBARK 


PATROL 


INDEED 


SEGOND 


DEGREE 


LITTLE 


OPPOSE 


WIRING 


EMBARK 


PISTOL 


ALLIED 


BEYOND 


STRAFE 


NOZZLE 


COURSE 


DURING 


VERBAL 


SYSTEM 


KILLED 


GROUND 


ENGAGE 


MUZZLE 


REFUSE 


NOTING 


RADIAL 


VICTIM 


FORMED 


METHOD 


DAMAGE 


SCHEME 


LOCATE 


MOVING ' 


SERIAL 


SIGCOM 


DOWNED 


PERIOD 


MANAGE 


RESUME 


EXCITE 


FLYING 


ANIMAL 


BOTTOM 


SCORED 


RECORD 


GARAGE 


ENGINE 


MINUTE 


BREACH 


FORMAL 


INFORM 


PASSED 


OFFICE 


BRIDGE 


RAVINE 


RESCUE 


DETACH 


NORMAL 


MEDIUM 


CAUSED 


POLICE 


ALLEGE 


EUROPE 


LEAGUE 


ATTACH 


SIGNAL 


SUDDEN 


UNITED 


ADVICE 


CHANGE 


SPHERE 


PURSUE 


BRANCH 


POSTAL 


SCREEN 
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SUNKEN 


MORTAR 


RUNNER 


FORCES 


CC 


ILORS 


TARGET 


CANNOT? 


MONDAY 


BROKEN 


RUBBER 


KEEPER 


BARGES 


AC 


ICESS 


PICKET 


ACCEPT 


SUNDAY 


SEAMEN 


MEMBER 


HELPER 


BODIES 


E> 


;cess 


ROCKET 


EXCEPT 


ANYWAY 


HAPPEN 


BOMBER 


PROPER 


ALLIES 


to 


[LESS 


BILLET 


PROMPT 


REMEDY 


BATTEN 


NUMBER 


NEARER 


ARMIES 


si 


RESS 


TURRET 


DEPART 


VALLEY 


ELEVEN 


PINCER 


ERASER 


TABLES 


AC 


Iross 


i SUNSET 


DESERT 


PARLEY 


REMAIN 


LEADER 


CENTER 


PLANES 


ASSETS 


! WEIGHT 


DIVERT 


CONVEY 


ATTAIN 


LADDER 


BETTER 


PASSES 


VISITS 


FLIGHT 


ESCORT 


SURVEY 


WITHIN 


MURDER 


LETTER 


LOSSES 


, POINTS 


SLIGHT 


EFFORT 


VERIFY 


COLUMN 


PREFER 


BITTER 


STATES 


STATUS 


‘NAUGHT 


'report 


SUPPLY 


RATION 


SUFFER 


LITTER 


ROUTES 


ALWAYS 


FOUGHT i 


' ARREST 


HOURLY 


ACTION 


MEAGER 


AFFAIR 


ISSUES 


COMBAT 1 


NOUGHT / 
CREDIT / 
SUBMIT/ 


RESIST 


DEPLOY 


COMMON 


HIGHER 


REPAIR 


CRISIS 


DEFEAT , 


ASSIST 


EMPLOY 


SUMMON 


CIPHER 


HARBOR 


SHELLS 


THREAT 


AUGUST 


CONVOY 


POISON 


EITHER 


TERROR 


SPOOLS 


DEFECT | 


COMMIT/ 


ADJUST 


OCCUPY 


LESSON 


TANKER 


MIRROR 


TRAINS 


EFFECT j 


SUMMIT 


DUGOUT 


SALARY 


PONTON 


HAMMER 


SECTOR 


SPOONS 


REJECT^ , 


RESUI/T 


OUTPUT 


ARMORY 


RETURN 


SUMMER 


VICTOR 


STRIPS 


SELECT, . 


ORIENT 


BUREAU 


NINETY 


DRYRUN 


BANNER 


DOCTOR 


TROOPS 


EXPECT 


INTENT 


REVIEW 


EIGHTY 


TATTOO 


MANNER 


CANVAS 


ORDERS 


DIRECT 


EXTENT 


FOLLOW 


TWENTY 


APPEAR 

DOLLAR 


GUNNER 


PLACES 


OTHERS 


( 'STREET 

1 

\ 


INVENT 


FRIDAY 


THIRTY 



MILITIA COVERED 
ANTENNA RETIRED 
ALMANAC ARMORED 
BIVOUAC PRESSED 
TRAFFIC CROSSED 
PACIFIC OMITTED 
ASIATIC DELAYED 
REDUCED COMMAND 
INVADED COMMEND 
DECIDED SUSPEND 
DECODED RESPOND 
ENCODED BOMBARD 
WOUNDED AWKWARD 
GUARDED FORWARD 
PROCEED REPLACE 
ENGAGED SERVICE 
DAMAGED ADVANCE 
REACHED ABSENCE 
MARCHED ENFORCE 
WRECKED BRIGADE 
SHELLED GRENADE 
DROPPED PRECEDE 
STOPPED OUTSIDE 
HUNDRED INCLUDE 
ORDERED EXCLUDE 



SEVEN LETTER WORDS 



REFUGEE 

WINDAGE 

BAGGAGE 

PACKAGE 

VILLAGE 

TONNAGE 

AVERAGE 

STORAGE 

BARRAGE 

PASSAGE 

MESSAGE 

COLLEGE 

ARRANGE 

WITHTHE 

THATTriE 

CHARLIE 

PRAIRIE 

VISIBLE 

BICYCLE 

HOSTILE 

EXTREME 

CONFINE 

MACHINE 

ROUTINE 

Cyclone 



WARFARE 

DECLARE 

PREPARE 

CALIBRE 

MISFIRE 

INSPIRE 

REQUIRE 

INQUIRE 

LECTURE 

RELEASE 

DISEASE 

SUNRISE 

LICENSE 

DEFENSE 

OFFENSE 

PROPOSE . 

SUPPOSE 

PURPOSE 

REVERSE 

BECAUSE^ 

MANDATE*/ 

RADIATE 

OPERATE 

ELEVATE 



Promote 

dOMMUTE 

REVENUE 

RELIEVE 

Receive 

passive 

dA&rivE 

revolve 

approve 

pBSERVE 
■ RESERVE 
/ANALYZE 
/JUMPOFF 
1 BOMBING 
PLACING 
FORCING 
HEADING 
LEADING 
LOADING 
BEDDING 
RAIDING 
HOLDING 
LANDING 
BINDING 
FINDING 



FORGING VARYING 

FISHING ICEBERG 

PUSHING ' DEBOUCH 

NOTHING THROUGH 

TALKING FURNISH 

SINKING TWELFTH 

smoking < seventh 

FALLING SETBACK 

FILLING ' DERRICK' 

KILLING DETRUCK 

RAINING ENTRUCK “ 

MANNING " MEDICAL 
RUNNING LOGICAL 

MORNING CONCEAL 

SLOPING ILLEGAL 

MAPPING MARSHAL 

Searing initial 

GASSING MARTIAL 

MESSING FEDERAL 

MISSING GENERAL 

LIFTING SEVERAL 

Halting central 

GETTING NATURAL 

FITTING COASTAL 

ISSUING GRADUAL 
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UNUSUAL 


ENTRAIN 


ENVELOP 


STARTER 


SUCCESS 


ASSAULT 


RAILWAY 


ARRIVAL 


CONTAIN 


SIMILAR 


QUARTER 


USELESS 


INSTANT 


SECRECY 


CHANNEL 


CAPTAIN 


REGULAR 


DELIVER 


ILLNESS 


ELEMENT 


VACANCY 


COLONEL 


CONDEMN 


CALIBER 


RECOVER 


WITNESS 


COMMENT 


SIGNIFY 


COUNCIL 


ABANDON 


OCTOBER 


AVIATOR 


ADDRESS 


CURRENT 


SATISFY 


FUELOIL 


OPINION 


OFFICER 


TRACTOR 


EXPRESS 


PRESENT 


RAPIDLY 


INSTALL 


SESSION 


POUNDER 


VISITOR 


DISMISS 


APPOINT 


QUICKLY 


DISTILL 


MISSION 


TRIGGER 


TACTICS 


DISCUSS 


RECEIPT 


NIGHTLY 


PAYROLL 


STATION 


WEATHER 


ISLANDS 


TARGETS 


ATTEMPT 


SHORTLY 


CONTROL 


SECTION 


WHETHER 


CHANGES 


SURPLUS 


SUPPORT 


COMPANY 


WILLIAM 


ECHELON 


ANOTHER 


ENEMIES 


RETREAT 


SUGGEST 


DESTROY 


DIAGRAM 


BALLOON 


FARTHER 


BATTLES 


EXTRACT 


HIGHEST 


PRIMARY 


PROGRAM 


PLATOON 


FURTHER 


GLASSES 


CONTACT 


NEAREST 


SUMMARY 


MINIMUM 


LIAISON 


SOLDIER 


CHASSIS 


COLLECT 


PROTEST 


LIBRARY 


‘ MAXIMUM 


HORIZON 


CARRIER 


ATTACKS 


RESPECT 


REQUEST 


JANUARY 


HASBEEN 


EASTERN 


COURIER 


VESSELS 


CORRECT 


AGAINST 1 


BRIBERY 


FIFTEEN 


WESTERN 


HEAVIER 


PATROLS 


PROTECT 


OUTPOST 


BATTERY 


SIXTEEN 


FOGHORN 


TRAWLER 


BOMBERS 


INFLICT 


PROVOST 


INQUIRY 


BETWEEN 


UNKNOWN 


STEAMER 


NUMBERS 


CONDUCT 


BOYCOTT 


CAVALRY 


KITCHEN 


TOBACCO 


CLIPPER 


REPAIRS 


TONIGHT 


WITHOUT 


VICTORY 


WRITTEN 


TORPEDO 


CRUISER 


SAILORS 


CIRCUIT 


LOOKOUT 


EMBASSY 


EXPLAIN 


WARSHIP 


AMMETER 


SECTORS 


RECRUIT 


SIMPLEX 


UTILITY 


TERRAIN 

DETRAIN 


DEVELOP 


FIGHTER 


COMPASS 


PURSUIT 


TUESDAY 


SEVENTY 



EIGHT LETTER WORDS 



INSIGNIA 


EXPELLED 


DICTATED 


STANDARD 


LANGUAGE 


ENVELOPE 


OPPOSITE 


SPECIFIC 


ENROLLED 


EFFECTED 


OUTBOARD 


DISLODGE 


INSECURE 


CONTINUE 


TERRIFIC 


DISARMED 


INFECTED 


OUTGUARD 


EXCHANGE 


PRESSURE 


CRITIQUE 


ECONOMIC 


ASSIGNED 


REJECTED 


WINDWARD 


PROBABLE 


DECREASE 


THATHAVE 


MECHANIC 


RETURNED 


SELECTED 


EASTWARD 


SUITABLE 


EXERCISE 


DECISIVE 


ATLANTIC 


APPEARED 


BILLETED 


WESTWARD 


ELIGIBLE 


SURPRISE 


POSITIVE 


RAILHEAD 


DECLARED 


INVENTED 


DESCRIBE 


TERRIBLE 


SUSPENSE 


PRESERVE 


RAILROAD 


PREPARED 


DEPARTED 


ORDNANCE 


POSSIBLE 


DISPERSE 


EQUALIZE 


REPLACED 


HINDERED 


DESERTED 


DISTANCE 


FLEXIBLE 


TRAVERSE 


MOBILIZE 


ADVANCED 


SUFFERED 


ESCORTED 


COMMENCE 


ASSEMBLE 


DEDICATE 


INVADING 


DEMANDED 


CENTERED 


DEPORTED 


SENTENCE 


OBSTACLE 


INDICATE 


DIVIDING 


EXPANDED 


BATTERED 


REPORTED 


ANNOUNCE 


ENCIRCLE 


INITIATE 


BUILDING 


DEFENDED 


LETTERED 


ARRESTED 


COMMERCE 


SCHEDULE 


ESTIMATE 


GUARDING 


OFFENDED 


REPAIRED 


ENLISTED 


ENFILADE 


MARITIME 


ORDINATE 


ENGAGING 


EXPENDED 


REQUIRED 


SURVIVED 


CONCLUDE 


AIRDROME 


DETONATE 


DAMAGING 


EXTENDED 


RESTORED 


IMPROVED 


LATITUDE 


AIRPLANE 


SEPARATE 


MARCHING 


GROUNDED 


DEFERRED 


OBSERVED 


ALTITUDE 


JETFLANE 


EVACUATE 


BREAKING 


BESIEGED 


CAPTURED 


REVIEWED 


EMPLOYEE 


MEDICINE 


EXCAVATE 


FLANKING 


DETACHED 


REPULSED 


DEPLOYED 


CARRIAGE 


DOCTRINE 


OBSOLETE 


TOTALING 


FINISHED 


COMPOSED 


AIRFIELD 


FUSELAGE 


POSTPONE 


COMPLETE 


SHELLING 


OCCUPIED 


MANDATED 


FOOTHOLD 


EQUIPAGE 


SEABORNE 


CONCRETE 


BATTLING 


ATTACKED 


DEFEATED 


THOUSAND 


FRONTAGE 


AIRBORNE 


EXPEDITE 


SWIMMING 


REPELLED 


REPEATED 


SURROUND 


SABOTAGE 


DEVELOPE 


DEFINITE 


TRAINING 
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PLANNING 


ELEVENTH 


CAMPAIGN 


PRISONER 


VEHICLES 


RESPECTS 


WITHDRAW 


SWEEPING 


ANTITANK 


CHAPLAIN 


IMPROPER 


MISFIRES 


ELEMENTS 


WITHDREW 


SHIPPING 


CODEBOOK 


MAINTAIN 


REPEATER 


DEFENSES 


ATTEMPTS 


TOMORROW 


GROUPING 


CHEMICAL 


MOUNTAIN 


DESERTER 


EXPENSES 


PROTESTS 


PARALLAX 


ENTERING 


CLERICAL 


BULLETIN 


DISASTER 


PURPOSES 


OUTPOSTS 


SATURDAY 


COVERING 


TACTICAL 


INVASION 


REGISTER 


RESERVES 


ENORMOUS 


THURSDAY 


RETIRING" 


CRITICAL 


DECISION 


CANISTER 


ANALYSIS 


LUMINOUS 


CAUSEWAY 


ADVISING 


NAUTICAL 


DIVISION 


RECEIVER 


BARRACKS 


RIGOROUS 


EFFICACY 


OPPOSING 


OFFICIAL 


LOCATION 


REVOLVER 


MISSIONS 


VIGOROUS 


IDENTIFY 


DRESSING 


MATERIAL 


AVIATION 


OBSERVER 


STATIONS 


CONTRACT 


STRATEGY 


PRESSING 


MEMORIAL 


CITATION 


MANEUVER 


FACTIONS 


INDIRECT 


PROBABLY 


CROSSING 


NATIONAL 


TAXATION 


EMPLOYER 


PONTOONS 


CONFLICT 


ASSEMBLY 


DRIFTING 


INTERNAL 


JUNCTION 


HOWITZER 


WARSHIPS 


DISTRICT 


ACTUALLY 


FIGHTING 


CORPORAL 


IGNITION 


CORRIDOR 


OFFICERS 


INSTRUCT 


MONOPOLY 


SIGHTING 


HOSPITAL 


POSITION 


SUPERIOR 


SOLDIERS 


AIRCRAFT 


EASTERLY 


LIMITING 


APPROVAL 


FORENOON 


INTERIOR 


CARRIERS 


DAYLIGHT 


WESTERLY 


PAINTING 


MATERIEL 


SQUADRON 


EXTERIOR 


TRAILER S 


MIDNIGHT 


BOUNDARY 


PRINTING 


PARALLEL 


GARRISON 


OPERATOR 


TRAWLERS 


PROHIBIT 


MILITARY 


SPOTTING 


SENTINEL 


NORTHERN 


DICTATOR 


CRUISERS 


SERGEANT 


SANITARY 


DELAYING 


SEALEVEL 


SOUTHERN 


REJECTOR 


FIGHTERS 


DOMINANT 


FEBRUARY 


RALLYING 


PROTOCOL 


CIRCULAR 


DIRECTOR 


QUARTERS 


ADJUTANT 


CEMETERY 


CARRYING 


MERCIFUL 


DECEMBER 


DETECTOR 


CARELESS 


ADJACENT 


ADVISORY 


FERRYING 


TELEGRAM 


REMEMBER 


ASSOONAS 


WIRELESS 


INCIDENT 


INFANTRY 


APPROACH 


AMERICAN 


NOVEMBER 


POLITICS 


BUSINESS 


ARMAMENT 


CAPACITY 


ENTRENCH 


EUROPEAN 


DEFENDER 


COMMANDS 


DARKNESS 


MOVEMENT 


FATALITY 


INTRENCH 


CIVILIAN 


RECORDER 


ADVANCES 


CONGRESS 


REGIMENT 


CALAMITY 


RESEARCH 


HAVEBEEN 


ENGINEER 


BARRAGES 


PROGRESS 


APPARENT 


VICINITY 


DESPATCH 


NINETEEN 


TRANSFER 


MESSAGES 


FORTRESS 


PASSPORT 


PRIORITY 


DISPATCH 


EIGHTEEN 


DECIPHER 


REMEDIES 


DISTRESS 


INTEREST 


ACTIVITY 


SKIRMISH 


THIRTEEN 


ENCIPHER 


SUPPLIES 


REDCROSS 


REENLIST 


CASUALTY 


DIMINISH 


FOURTEEN 










1 



NINE LETTER WORDS 



MEMORANDA 


CANCELLED 


IMPRESSED 


ATTEMPTED 


ASSURANCE 


AERODROME 


STRATEGIC 


COMPELLED 


DISCUSSED 


PROTESTED 


ALLOWANCE 


HURRICANE 


AUTOMATIC 


DETRAINED 


INDICATED 


REQUESTED 


INCIDENCE 


AEROPLANE 


PATRIOTIC 


ENTRAINED 


POPULATED 


SUBMITTED 


REFERENCE 


INTERVENE 


BALLISTIC 


CONDEMNED 


ESTIMATED 


CONTINUED 


INFLUENCE 


FRONTLINE 


BEACHHEAD 


ECHELONED 


DOMINATED 


DESTROYED 


REENFORCE 


DETERMINE 


SPEARHEAD 


DEVELOPED 


DETONATED 


MOTORIZED 


REINFORCE 


TELEPHONE 


DESCRIBED 


CONQUERED 


SUSPECTED 


SEMIRIGID 


LONGITUDE 


INTERFERE 


ANNOUNCED 


PREFERRED 


CORRECTED 


RECOMMEND 


COMMITTEE 


ELSEWHERE 


BLOCKADED 


CONFERRED 


PROTECTED 


REARGUARD 


ADVANTAGE 


SHELLFIRE 


SUCCEEDED 


DECREASED 


INFLICTED 


NORTHWARD 


CARTRIDGE 


THEREFORE 


PROCEEDED 


INCREASED * 


COMPLETED 


SOUTHWARD 


CHALLENGE 


PROCEDURE 


COMMANDED 


CONDENSED 


INHABITED 


AMBULANCE 


AVAILABLE 


PREMATURE 


SUSPENDED 


COLLAPSED 


EXHIBITED 


DOMINANCE 


UNTENABLE 


DEPARTURE 


BOMBARDED 


DISPERSED 


ASSAULTED 


CLEARANCE 


DIRIGIBLE 


NAVALBASE 


FORTIFIED 


ADDRESSED 


APPOINTED 


ENDURANCE 


PRINCIPLE 


MANGANESE 
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PERSONNEL 

CABLEGRAM 

RADIOGRAM 

FIREALARM 

CRITICISM 

MECHANISM 

DIETITIAN 

SEVENTEEN 

SUSPICION 

BATTALION 

REBELLION 

COLLISION 

PROVISION 

EXPANSION 

ASCENSION 

DIMENSION 

EXTENSION 

EXPLOSION 

ADMISSION 

EXCLUSION 

RADIATION 

VARIATION 

INFLATION 

FORMATION 

OPERATION 

SITUATION 

ELEVATION 

OBJECTION 

DIRECTION 

CONDITION 

COALITION 

PARTITION 

DETENTION 

RETENTION 

INTENTION 

ATTENTION 



INVENTION 

PROMOTION 

SEMIG'OLON 

AFTERNOON 

DISAPPEAR 

IRREGULAR 

SEPTEMBER 

COMMANDER 

SURRENDER 

REMAINDER 

PASSENGER 

MESSENGER 

BRIGADIER 

STRAGGLER 

NEWSPAPER 

CHARACTER 

KJLOMETER 

BAROMETER 

GYROMETER 

DESTROYER 

PROJECTOR 

PROTECTOR 

CHAUFFEUR 

LOGISTICS 

STANDARDS 

RESOURCES 

COMPANIES 

BATTERIES 

EMBASSIES 

AIRDROMES 

SEAPLANES 

AIRPLANES 

EXERCISES 

WITNESSES 

ADDRESSES 

ESTIMATES 



''TEN LETTER WORDS 



REGARDING 

ACCORDING 

INCLUDING 

LAUNCHING 

ATTACKING 

DEBARKING 

REFILLING 

SCREENING 

REMAINING 

OBTAINING 

INCLINING 

BEGINNING 

RETURNING 

PREPARING 

NUMBERING 

CENTERING 

REQUIRING 

OPERATING 

ENLISTING 

RECEIVING 

REVIEWING 

EMPLOYING 

OCCUPYING 

PARAGRAPH 

ESTABLISH 

TWENTIETH 

FIFTEENTH 

SIXTEENTH 

WATERTANK 

TECHNICAL 

CHRONICAL 

PRACTICAL 

POLITICAL 

IDENTICAL 

PRINCIPAL 

DISMISSAL 

CONTINUAL 



CONTINUES ' 

BUILDINGS 

OFFICIALS 

REPRISALS 

PROPOSALS 

CIVILIANS 

CAMPAIGNS 

MAINTAINS 

DIVISIONS 

MUNITIONS 

POSITIONS 

ENGINEERS 

PRISONERS 

READINESS 

CONFLICTS 

DISTRICTS 1 

INCIDENTS 

MOVEMENTS 

OUTSKIRTS 

ANONYMOUS 

APPARATUS 

DISINFECT 

INTERDICT 

DIFFICULT 

COMBATANT 

IMPORTANT 

ASSISTANT 

CONFIDENT 

PRESIDENT 

DEPENDENT 

NEGLIGENT 

DEFICIENT 

EFFICIENT 

PLACEMENT 

AGREEMENT 

AMUSEMENT 



CRITICISE 

INTERPOSE 

ASSOCIATE 

IMMEDIATE 

OSCILLATE 

CIRCULATE 

DESIGNATE 

ALTERNATE 

COOPERATE 

ELABORATE 

PENETRATE 

REINSTATE 

CIGARETTE 

PARACHUTE 

DESTITUTE 

TECHNIQUE 

EXPANSIVE 

DEFENSIVE 

OFFENSIVE 

EXPENSIVE 

INTENSIVE 

EXTENSIVE 

EXPLOSIVE 

EXCESSIVE 

INCLUSIVE 

EXCLUSIVE 

TENTATIVE 

DEFECTIVE 

EFFECTIVE 

OBJECTIVE 

INCENTIVE 

EXECUTIVE 

RECOGNIZE 

SERVICING 

ADVANCING 

PRECEDING 

EXTENDING 



ATOMICBOMB 

GEOGRAPHIC 

GYROSCOPIC 

DIPLOMATIC 

BRIDGEHEAD 

PRESCRIBED 

REENFORCED 

REINFORCED 

BEENNEEDED 

UNEXPENDED 



APPROACHED 

ENTRENCHED 

DESPATCHED 

DISPATCHED 

THREATENED 

MAINTAINED 

DETERMINED 

ONEHUNDRED 

DECIPHERED , 

ENCIPHERED 



COMPRESSED 

DISTRESSED 

DESIGNATED 

RESTRICTED 

INSTRUCTED 

PROHIBITED 

REENLISTED 

MECHANIZED 

CONTRABAND 

UNDERSTAND 



UNDERSTOOD 

COASTGUARD 

POSTOFFiqE 

ACCORDANCE 

ALLEGIANCE 

APPEARANCE 

ACCEPTANCE 

RESISTANCE 

ASSISTANCE 

PRECEDENCE 



STATEMENT 

EQUIPMENT 

GROUPMENT 

INTERMENT 

ALLOTMENT 

PERMANENT 

DIFFERENT 

REPRESENT 

RESTRAINT 

INTERCEPT 

INTERRUPT 

TRANSPORT 

NORTHEAST 

SOUTHEAST 

NORTHWEST 

SOUTHWEST 

INTERVIEW 

YESTERDAY 

WEDNESDAY 

EMERGENCY 

NORTHERLY 

SERIOUSLY 

INSTANTLY 

ACCOMPANY 

ARBITRARY 

NECESSARY 

SECRETARY 

ARTILLERY 

ACCESSORY 

TERRITORY 

LIABILITY 

HOSTILITY 

PROXIMITY 

INDEMNITY 

INTEGRITY 

NECESSITY 



CONFIDENCE 

NEGLIGENCE 

EXPERIENCE 

PREFERENCE 

DIFFERENCE 

CONFERENCE 

CAMOUFLAGE 

DEPENDABLE 

EXPENDABLE 

UNSUITABLE 
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ACCEPTABLE 

IMPASSIBLE 

IMPOSSIBLE 

ASPOSSIBLE 

RECEPTACLE 

MOTORCYCLE 

AUTOMOBILE 

DISCIPLINE 

QUARANTINE 

ENTERPRISE 

TRANSVERSE 

COORDINATE 

ILLUMINATE 

ANTICIPATE 

ILLITERATE 

ILLUSTRATE 

COMPENSATE 

DISTRIBUTE 

SUBSTITUTE 

CONSTITUTE 

COMMUNIQUE 

TWENTYFIVE 

SUCCESSIVE 

IMPRESSIVE 

LOCOMOTIVE 

CENTRALIZE 

NATURALIZE 

DEMOBILIZE 

COMMANDING 

DEBOUCHING 

DETRUCKING 

ENTRUCKING 

ENCIRCLING 

SIGNALLING 

PATROLLING 

OVERCOMING 

DETRAINING 

CONCERNING 

INDICATING 

ANTEDATING 



IMPEDIMENTA 
, TOPOGRAPHIC 
RECOMMENDED 
PREARRANGED 
ESTABLISHED 
OVERWHELMED 
DISAPPEARED 



TEN LETTER WORDS— Continued 



EVACUATING 

COLLECTING 

CONNECTING 

INFLICTING 

EXPEDITING 

RECRUITING 

ATTEMPTING 

SUPPORTING 

EXTINGUISH 

NINETEENTH 

EIGHTEENTH 

THIRTEENTH 

FOURTEENTH 

WILLATTACK 

ARTIFICIAL 

CREDENTIAL 

ADDITIONAL 

ACCIDENTAL 

REGIMENTAL 

INDIVIDUAL 

WITHDRAWAL 

AIRCONTROL 

SUCCESSFUL 

RESPECTFUL 

MEMORANDUM 

SUSPENSION 

DISPERSION 

CONCESSION 

CONFESSION 

DEPRESSION 

IMPRESSION 

POSSESSION 

SUBMISSION 

COMMISSION 

PERMISSION 

DISCUSSION 

CONCLUSION 

DEDICATION 

INDICATION 



ALLOCATION 

FOUNDATION 

RECREATION 

IRRIGATION 

NAVIGATION 

REGULATION 

POPULATION 

ESTIMATION 

DOMINATION 

DETONATION 

OCCUPATION 

SEPARATION 

DECORATION 

LIMITATION 

SANITATION 

INVITATION 

EVACUATION 

EVALUATION 

EXCAVATION 

COLLECTION 

CONNECTION 

INSPECTION 

CORRECTION 

PROTECTION 

EXHIBITION 

EXPEDITION 

DEFINITION 

AMMUNITION 

OPPOSITION 

PROPORTION 

REVOLUTION 

MACHINEGUN 

BATTLESHIP 

CENSORSHIP 

ARMOREDCAR 

DIVEBOMBER 

COMMANDEER 

DISPATCHER 

MILLIMETER 



GONIOMETER 

HYDROMETER 

HYGROMETER 

AMBASSADOR 

INSTRUCTOR 

BALLISTICS 

STATISTICS 

CROSSROADS 

DESPATCHES 

DISPATCHES 

ASSEMBLIES 

FACILITIES 

ACTIVITIES 

CASUALTIES 

FRONTLINES 

SUBMARINES 

OBJECTIVES 

ENEMYTANKS 

SUSPICIONS 

COLLISIONS 

PROVISIONS 

EXPLOSIONS 

FORMATIONS 

OPERATIONS 

DIRECTIONS 

CONDITIONS 

TROOPSHIPS 

NEWSPAPERS 

KILOMETERS 

DESTROYERS 

TRANSPORTS 

SUSPICIOUS 

VICTORIOUS 

CIRCUITOUS 

CONTINUOUS 

PHOSPHORUS 

FLASHLIGHT 

COMMANDANT 

LIEUTENANT 



ELEVEN LETTER WORDS 

SURRENDERED CONSTITUTED INFLAMMABLE 
ENCOUNTERED BATTLEFIELD RESPONSIBLE 
TRANSFERRED PERFORMANCE NAVALBATTLE 
DISINFECTED MAINTENANCE TEMPERATURE 
REAPPOINTED COINCIDENCE^ MANUFACTURE 
INTERCEPTED SUBSISTENCE SCHOOUpJSE 
INTERRUPTED CATASTROPHE CUSTOMHOUSE 



CONTINGENT 

SUFFICIENT 

CONVENIENT 

EQUIVALENT 

ENGAGEMENT 

MANAGEMENT 

EXCITEMENT 

DETACHMENT 

ATTACHMENT 

EXPERIMENT 

ENROLLMENT 

ASSIGNMENT 

ATTAINMENT 

INTERNMENT 

GOVERNMENT 

ASSESSMENT 

COMMITMENT 

DEPARTMENT 

ENLISTMENT 

INSTRUMENT 

DEPLOYMENT 

EMPLOYMENT 

PERSISTENT 

AIRSUPPORT 

CONSPIRACY 

DEFICIENCY 

EFFICIENCY 

COMPLETELY 

APPARENTLY 

INCENDIARY 

COMMISSARY 

ELEMENTARY 

LABORATORY 

TRAJECTORY 

CAPABILITY 

AUDIBILITY 

VISIBILITY 

SIMILARITY 

INSECURITY 



CERTIFICATE 

COMMUNICATE 

INVESTIGATE 

APPROPRIATE 

APPROXIMATE 

EXTERMINATE 

DETERIORATE 
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CONCENTRATE 

DEMONSTRATE 

NECESSITATE 

DISCONTINUE 

SEVENTYPIVE 

PROGRESSIVE 

RETROACTIVE 

DESCRIPTIVE 

SYNCHRONIZE 

APPROACHING 

INTERVENING 

ENGINEERING 

INTERFERING 

ALTERNATING 

INTERESTING 

WITHDRAWING 

DISTINGUISH 

SEVENTEENTH 

NAVALATTACK 

STRATEGICAL 

TRADITIONAL 

CONTINENTAL 

FIRECONTROL 

NATIONALISM 



TRANSPACIFIC 

HYDROGRAPHIC 

UNIDENTIFIED 

COMMISSIONED 

DISSEMINATED 

CONCENTRATED 

DEMONSTRATED 

DISORGANIZED 

SIGNIFICANCE 

INTELLIGENCE 

INTERFERENCE 

INCOMPETENCE 

CONSIDERABLE 

FIGHTERPLANE 

INTERMEDIATE 

DECENTRALIZE 

GENERALSTAFF 

TRANSFERRING 

ENTERPRISING 

ILLUMINATING 

DISTRIBUTING 



ELEVEN LETTER WORDS— Continued 



SMOKESCREEN 

APPLICATION 

ASSOCIATION 

RETALIATION 

DEBARKATION 

EMBARKATION 

LEGISLATION 

CIRCULATION 

INFORMATION 

EXPLANATION 

DESIGNATION 

RESIGNATION 

EXAMINATION 

PREPARATION 

COOPERATION 

IMMIGRATION 

INSPIRATION 

CORPORATION 

PENETRATION 

ARBITRATION 

COMPUTATION 

OBSERVATION 

RESERVATION 

RESTRICTION 



DISTINCTION 

DESTRUCTION 

INSTRUCTION 

RECOGNITION 

REQUISITION 

COMPOSITION 

DISPOSITION 

COMPETITION 

DESCRIPTION 

CONSUMPTION 

INSTITUTION 

LIGHTBOMBER 

HEAVYBOMBER 

RANGEFINDER 

DYNAMOMETER 

THERMOMETER 

INTERPRETER 

RECONNOITER 

BLOCKBUSTER 

AERONAUTICS 

NAVALFORCES 

ACCESSORIES 

HOSTILITIES 

ENEMYPLANES 



PHILIPPINES 

PARENTHESES 

HEAVYLOSSES 

COMMUNIQUES 

PARENTHESIS 

CREDENTIALS 

BATTLESHIPS 

ARMOREDCARS 

CORRECTNESS 

ENGAGEMENTS 

ASSIGNMENTS 

ASSESSMENTS 

INSTRUMENTS 

INTERCERPTS 

ESTIMATEDAT 

SIGNIFICANT 

INDEPENDENT 

INTELLIGENT 

COEFFICIENT 

BOMBARDMENT 

REPLACEMENT 

EMPLACEMENT 

ENFORCEMENT 

ARRANGEMENT 



TWELVE LETTER WORDS 



CONSTITUTING 

BREAKTHROUGH 

GEOGRAPHICAL 

CONFIDENTIAL 

PRESIDENTIAL 

RECREATIONAL 

AGRICULTURAL 

DEPARTMENTAL 

UNSUCCESSFUL 

GENERALALARM 

VETERINARIAN 

TRANSMISSION 

VERIFICATION 

CONFISCATION 

COMMENDATION 

CONCILIATION 

CANCELLATION 

PROCLAMATION 

CONFIRMATION 

CONFORMATION 

COORDINATION 



ILLUMINATION 

ANTICIPATION 

REGISTRATION 

ILLUSTRATION 

INAUGURATION 

COMPENSATION 

CONVERSATION 

RADIOSTATION 

CONTINUATION 

PRESERVATION 

MOBILIZATION 

ORGANIZATION 

INTERDICTION 

ROADJUNCTION 

INTRODUCTION 

CONSTRUCTION 

INTERVENTION 

CONSCRIPTION 

INTERRUPTION 

DISTRIBUTION 

SUBSTITUTION 



CONSTITUTION 

NORTHWESTERN 

SOUTHWESTERN 

MARKSMANSHIP 

MEDIUMBOMBER 

COMMISSIONER 

PSYCHROMETER 

SHARPSHOOTER 

DIFFICULTIES 

UNITEDSTATES 

PREPARATIONS 

OBSTRUCTIONS 

INSTRUCTIONS 

LIGHTBOMBERS 

HEAVYBOMBERS 

HEADQUARTERS 

PREPAREDNESS 

COMPLETENESS 

CARELESSNESS 

SEARCHLIGHTS 

REPLACEMENTS 



CONFINEMENT 

REQUIREMENT 

MEASUREMENT 

IMPROVEMENT 

CONCEALMENT 

ECHELONMENT 

DEVELOPMENT 

APPOINTMENT 

COMPARTMENT 

BELLIGERENT 

INCOMPETENT 

FINGERPRINT 

DISCREPANCY 

PHOTOGRAPHY 

IMMEDIATELY 

EXTENSIVELY 

EFFECTIVELY 

PRELIMINARY 

CONTROVERSY 

ELECTRICITY 

NATIONALITY 

SUITABILITY 

SUPERIORITY 



EMPLACEMENTS 

MEASUREMENTS 

ADVANTAGEOUS 

SIMULTANEOUS 

ANTIAIRCRAFT 

NONCOMBATANT 

CONVALESCENT 

DISPLACEMENT 

COMMENCEMENT 

ANNOUNCEMENT 

ENTANGLEMENT 

DECIPHERMENT 

ENCIPHERMENT 

REENLISTMENT 

INEFFICIENCY 

SUCCESSFULLY 

RESPECTFULLY 

SATISFACTORY 

INTRODUCTORY 

IRREGULARITY 
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TRANSATLANTIC 

DISTINGUISHED 

DECENTRALIZED 

DISAPPEARANCE 

IMPRACTICABLE 

INDETERMINATE 

CORRESPONDING 

CONCENTRATING 

COUNTERATTACK 



CHARACTERISTIC 

RECONNAISSANCE 

DISCONTINUANCE 

CORRESPONDENCE 

ADMINISTRATIVE 

REPRESENTATIVE 

DISTINGUISHING 



THIRTEEN LETTER WORDS 



CHRONOLOGICAL 

CONGRESSIONAL 

INTERNATIONAL 

SPECIFICATION 

QUALIFICATION 

COMMUNICATION 

ACCOMMODATION 

INVESTIGATION 

DISSEMINATION 



DETERMINATION 

EXTERMINATION 

CONSIDERATION 

CONCENTRATION 

DEMONSTRATION 

QUARTERMASTER 

CIRCUMSTANCES 

DISCREPANCIES 

PRELIMINARIES 



FIGHTERPLANES 

INSTALLATIONS 

MEDIUMBOMBERS 

MISCELLANEOUS 

INSTANTANEOUS 

REENFORCEMENT 

REINFORCEMENT 

REIMBURSEMENT 

REINSTATEMENT 



FOURTEEN LETTER WORDS 



RECONNOITERING 

Meteorological 

CIRCUMSTANTIAL 

CLASSIFICATION 

IDENTIFICATION 

RECOMMENDATION 



ADMINISTRATION 

INTERPRETATION 

TRANSPORTATION 

CENTRALIZATION 

NATURALIZATION 

DEMOBILIZATION 



ESTABLISHMENT 

ENTERTAINMENT 

REAPPOINTMENT 

WARDEPARTMENT 

APPROXIMATELY 

EXTRAORDINARY 

REVOLUTIONARY 

DEPENDABILITY 



REORGANIZATION 

RECONSTRUCTION 

IRREGULARITIES 

INVESTIGATIONS 

SATISFACTORILY 

RESPONSIBILITY 
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C. LIST OF WORDS USED IN MILITARY TEXT ARRANGED ALPHABETICALLY 

ACCORDING TO WORD PATTERN 



PATTERN AA 



A 


CC 


EPT 


FA 


LL 






MA 


NN ER 


A 


CC 


ORDING 


FE 


LL 






A 


NN EX 


0 


CC 


UPY 


FU 


LL 






CA 


NN 


OT 


A 


DD 




HI 


LL 






T 


00 




SU 


DD 


EN 


I 


LL 






W 


00 


DS 


LA 


DD 


ER 


INSTA 


LL 






PR 


00 


F 


BE 


DD 


ING 


PAYRO 


LL 






B 


00 


K 


FL 


EE 




REFI 


LL 






C 


00 


K 


S 


EE 




SHE 


LL 






H 


00 


K 


THR 


EE 




SMA 


LL 


i 




L 


00 


K 


PROC 


EE 


D 


SPE 


LL 






T 


00 


K 


SP 


EE 


D 


WE 


LL 






SCH 


00 


L 


CR 


EE 


K 


WI 


LL 






T 


00 


L 


W 


EE 


K 


VI 


LL 


AGE 




PLAT 


00 


N 


F 


EE 


L 


CO 


LL 


APSED 




S 


00 


N 


ST 


EE 


L 


DO 


LL 


AR 




TR 


00 


PS 


WH 


EE 


L 


OSCI 


LL 


ATE 




C 


00 RDINATE 


B 


EE 


N 


KI 


LL 


ED 




B 


00 TH 


FOURT 


EE 


N 


BI 


LL 


ET 




STO 


PP ED 


HASB 


EE 


N 


BU 


LL 


ETIN 




HA 


PP EN 


QU 


EE 


N 


VA 


LL 


EY 




CLI 


PP ER 


SCR 


EE 


N 


A 


LL 


IED 




MA 


PP 


ING 


S 


EE 


N 


A 


LL 


IES 




A 


PP 


LY 


SIXT 


EE 


N 


FA 


LL 


ING 


- 


SU 


PP 


LY 


R 


EE 


NLIST 


PATRO 


LL 


ING 




A 


PP 


OINT 


K 


EE 


P 


SHE 


LL 


ING 




A 


PP 


OINTED 


Sff 


EE 


PING 


A 


LL 


OW 




SU 


PP 


ORT 


F 


EE 


T 


A 


LL 


Y 




SU 


PP 


ORTING 


FL 


EE 


T 


RA 


LL 


Y 




A 


PP ROVE 


M 


EE 


T 


•CO 


MM 


A ( 




TE 


RR 


AIN 


JUMPO 


FF 




CO 


MM 


AND 

ANDER 




CU 


RR 


ENT 


0 


FF 




CO 


MM 




A 


RR,EST 


STA 


FF 




SU 


MM 


ARif 




HU 


RR 


ICANE 


0 


FF 


END 


CO 


MM 


END 




DE 


RR 


ICK 


SU 


FF 


ER 


CO 


MM 


ENiy 




GA 


RR 


ISON 


TRA 


FF 


IC 


HA 


MM 


ER 




A 


RR 


IVE 


0 


FF 


ICE 


SU 


MM 


ER 




CA 


RR 


Y 


0 


FF 


ICER 


CO 


MM 


IT 




FE 


RR 


Y 


E 


FF 


ORT 


SU 


MM 


IT 




ACRO 


SS 




FO 


GG 


Y 


SU 


MM 


ON 




COMPA 


SS 




A 


LL 




CO 


MM 


UTB/ 

AGS 




CONGRE 


SS 




CA 


LL 




TO 


NN 




CRO 


SS 




CE 


LL 




CHA 


NN 


EL 




DARKNE 


SS 




DRI 


LL 




BA 


NN 


ER 




DRE 


SS 




ENRO 


LL 




GU 


NN 


ER 




LE 


SS 
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RESTRICTED , 

PATTERN AA— Continued 



LO 


ss 




A 


ss 


IGNED 


BA 


TT 


EN 


MA 


ss 




CRO 


SS 


ING 


WRI 


TT 


EN 


ME 


ss 




DRE 


SS 


ING 


BI 


TT 


ER 


PA 


ss 




ME 


SS 


ING 


LI 


TT 


ER 


PRE 


ss 




PA 


SS 


IVE 


BA 


TT 


ERY 


UNLE 


ss 




LE 


SS 


ON 


SPO 


TT 


ING 


WITNE 


ss 




I 


SS 


UE 


BA 


TT 


LE 


PA 


ss 


ED 


A 


SS 


URE 


BA 


TT 


LESHIP 


A 


ss 


EMBLY 


EMBA 


SS 


Y 


MU 


ZZ 


LE 


A 


ss 


ET 


OMI 


TT 


ED 


NO 


ZZ 


LE 


PO 


ss 


IBLE 


SUBMI 


TT 


ED 









MISCELLANEOUS PATTERNS 



AABA 


AGR EEME NT 


AABA 


K EEPE R 


AABA 


CH EESE 


AABA 


BR EEZE 


AABA 


MA NNIN G 


AABA 


PLA NNIN G 


AABA 


RU NNIN G 


AABA 


L OOKO UT 


AABA 


E RROR 


AABA 


MI RROR 


AABA 


TE RROR 


AABA 


GLA SSES 


AABA 


LO SSES 


AABA 


PA SSES 


AABA 


CHA SSIS 


AABA 


A SSIS T 


AABAACB 


A SSESSME NT 


AABAACBDEA 


A SSFSSMENTS 


AABAB 


PROC EEDED 


AABB 


CO FFEE 


AABB 


BA LLOO N 


AABBAACAC 


B EENNEEDED 


AABBCBC 


SU CCEEDED 


AABCA 


B EETLE 


AABCA 


A NNOUN CE 


AABCA 


F OOTHO LD 


AABCA 


CA RRIER 


AABCA 


A SSETS 


AABCA 


I SSUES 


AABCADEC 


CO MMITMENT 


AABCADEC 


A TTENTION 


AABCADEFEA 


A NNOUNCEMEN 


AABCB 


SQ£ EENIN G 


AABCB 


SJ FFERE D 


AABCB 


DI FFERE NT 


AABCB 


0 FFICI AL 



AABCB 


SU FFICI ENT 


AABCB 


A LLEGE 


AABCB 


CO LLEGE 


AABCB 


BI LLETE D 


AABCB 


A MMETE R 


AABCB 


W OODED 


AABCB 


TE RRIFI C 


AABCB 


BA TTERE D 


AABCBDEB 


DI FFERENCE 


AABCC 


A CCESS 


AABCC 


A CCESS ORY 


AABCC 


CO MMISS ARY 


AABCCB 


WI LLATTA ck 


AABCCDD 


CO MMITTEE 


^ABCCDEFBC 


A CCESSORIES 


AABCDA 


I LLEGAL 


/lABCDA 


A TTEMPT 


AABCDAB 


A TTEMPTE D 


aabcdb 


0 FFENSE 


AABCDB 


CHA LLENGE 


aabcdb 


BA LLISTI C 


AABCDB 


A RRESTE D 


AABCDB 


PA SSENGE R 


AABCDB 


BA TTERIE S 


AABCDBA 


SU RRENDER 


aabcdbabd 


SU RRENDERED 


AABCDBC 


CO MMANDAN T 


AABCDBD 


0 FFENDED 


AABCDBEC 


BA LLISTI CS 


aIabcdc 


E FFICAC Y 


AABCDD 


A DDRESS 


AABCDD 


I LLNESS 


A'ABCDDCA 


A DDRESSED 


a'abcddcd 


A DDRESSES 


a!abcdeb 

| 


CO MMUNIQU E 


AABCDEB 


TR OOPSHIP 
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MISCELLANEOUS PATTERNS— Continued 


A 


AABCDEB 


A SSEMBLE 


ABA 


INVA DED 


W 


AABCDEBC 


TR OOPSHIPS 


ABA 


LAN DED 




AABCDEC 


CO MMANDIN G 


ABA 


RAI DED 




AABCDEC0 


BA TTLEFIEL D 


ABA 


WOUN DED 




AABCDED 


CO MMANDED 


ABA 


DID 




AABCDEDFd 


A MMUNITION 


ABA 


IC EBE RG 




AABCDEE 


CO MMANDEE R 


ABA 


PR ECE DING 




AABCDEFA 


R EENLISTE D 


ABA 


R ECE IPT 




AABCDEFA 


I RREGULAR 


ABA 


CR EDE NTIAL 




AABCDEFB 


0 FFENSIVE 


ABA 


F EDE RAL 




AABCDEFBA 


A SSEMBLIES 


ABA 


D EFE AT 




AABCDEFC 


A LLOTMENT 


ABA 


D EFE CT 




AABCDEFC 


C OOPERATE 


ABA 


D EFE R 




AABCDEFD 


I LLUSTRAT E 


ABA 


SI EGE 




AABCDEFD 


A SSIGNMEN T 


ABA 


R EJE CT 




AABCDEFOGA 


A SSIGNMENTS 


ABA 


S ELE CT 




AABCDEFGA 


C OOPERATIO N 


ABA 


T ELE GRAM 




AABCDEFGABF 


R EENLISTMENT 


ABA 


ELE VATION 




AABCDEFGD 


BA TTLESHIPS 


ABA 


SCH EME 




AABCDEFGDAE 


C OORDINATION 


ABA 


R EME DY 




AABCDEFGDE 


A PPOINTMENT 


ABA 


DISPLAC EME NT 




ABA 


AGA IN 


ABA 


PLAC EME NT 




ABA 


AGA INST 


ABA 


ENE MY 




ABA 


C ALA MITY 


ABA 


G ENE RAL 




ABA 


ALA RM 


ABA 


R EPE L 




ABA 


S ALA RY 


ABA 


H ERE 


• 


ABA 


D AMA GE 


ABA 


SPH ERE 




ABA 


M ANA GE 


ABA 


TH ERE 




ABA 


C ANA L 


ABA 


W ERE 




ABA 


ANA LYZE 


ABA 


WH ERE 




ABA 


J APA N 


ABA 


CONQU ERE D 




ABA 


P ARA CHUTE 


ABA 


COV ERE D ' 




ABA 


P ARA DE 


ABA 


TH ESE 




ABA 


SEP ARA TION 


ABA 


PR ESE NT 




.ABA 


F ATA L 


ABA 


D ESE RT 




ABA 


N AVA L 


ABA 


COMPL ETE 




ABA 


N AVA LFORCES 


ABA 


KILOM ETE R 




ABA 


C AVA LRY 


ABA 


M ETE R 




ABA 


EXC AVA TION 


ABA 


P ETE R 




ABA 


AWA IT 


ABA 


D EVE LOP 




ABA 


AWA RD 


ABA 


S EVE N 




ABA 


AWA Y 


ABA 


S EVE NTH 




ABA 


PRO BAB LE 


ABA 


S EVE NTY 




ABA 


PRO BAB LY 


ABA 


S EVE RAL 




ABA 


BI CYC LE 


ABA 


EVE RY 




ABA 


CYC LONE 


ABA 


EYE 




ABA 


BLOCKA DED 


ABA 


FIF TH 




ABA 


GROUN DED 


ABA 


FIF TY 




ABA 


GUAR DED 


ABA 


EIG HTH 
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LO SS 


r 


PATTERN 

A 


AA — Continued 
SS IGNED 


BA TT EN 


MA SS 




CRO 


SS ING 


WRI TT EN 


ME SS 




DRE 


SS ING 


BI TT ER 


PA SS 




ME 


SS ING 


LI TT ER 


PRE SS 




PA 


SS IVE 


BA TT ERY 


UNLE SS 




LE 


SS ON 


SPO TT ING 


WITNE SS 




I 


SS UE 


BA TT LE 


PA SS ED 




A 


SS URE 


BA TT LESHIP 


A SS EMBLY 




EMBA 


SS Y 


MU ZZ LE 


A SS ET 




OMI 


TT ED 


NO ZZ LE 


PO SS IBLE 
AABA 


AGR 


SUBMI TT ED 

MISCELLANEOUS PATTERNS 
EEME NT AABCB 


SU FFICI ENT 


AABA 


K 


EEPE R 


AABCB 


A LLEGE 


AABA 


CH 


EESE 


AABCB 


CO LLEGE 


AABA 


BR 


EEZE 


AABCB 


BI LLETE D 


AABA 


MA 


NNIN G 


AABCB 


A MMETE R 


AABA 


PLA 


NNIN G 


AABCB 


W OODED 


AABA 


RU 


NNIN G 


AABCB 


TE RRIFI C 


AABA 


L 


OOKO UT 


AABCB 


BA TTERE D 


AABA 


E 


RROR 


AABCBDEB 


DI FFERENCE 


AABA 


MI 


RROR 


AABCC 


A CCESS 


AABA 


TE 


RROR 


AABCC 


A CCESS ORY 


AABA 


GLA 


SSES 


AABCC 


CO MMISS ARY 


AABA 


LO 


SSES 


AABCCB 


WI LLATTA CK 


AABA 


PA 


SSES 


AABCCDD 

AABCCDEFBC 

AABCDA 


CO MMITTEE 


AABA 


CHA 


SSIS 


A CCESSORIES 


AABA 


A 


SSIS T 


I LLEGAL 


AABAACB 


A 


SSESSME NT 


AABCDA 


A TTEMPT 


AABAACBDEA 


A 


SSFSSMENTS 


AABCDAB 


A TTEMPTE D 


AABAB 


PROC 


EEDED 


AABCDB 


0 FFENSE 


AABB 


CO 


FFEE 


AABCDB 


CHA LLENGE 


AABB 


BA 


LLOO N 


AABCDB 


BA LLISTI C 


AABBAACAC 


B 


EENNEEDED 


AABCDB 


A RRESTE D 


AABBGBC 


SU 


CCEEDED 


AABCDB 


PA SSENGE R 


AABCA 


B 


EETLE 


AABCDB 


BA TTERIE S 


AABCA 


A 


NNOUN CE 


AABCDBA 


SU RRENDER 


AABCA 


F 


OOTHO LD 


AABCDBABD 


SU RRENDERED 


AABCA 


CA 


RRIER 


AABCDBC 


CO MMANDAN T 


AABCA 


A 


SSETS 


AABCDBD 


0 FFENDED 


AABCA 


I 


SSUES 


AABCDBEC 

aIabcdc 


BA LLISTICS 


AABCADEC 


S9 


MMITMENT 


E FFICAC Y 


AABCADEC 


A 


TTENTION 


AABCDD 


A DDRESS 


AABCADEFEA 


A 


NNOUNCEMEN T 


AABCDD 


I LLNESS 


AABCB 


SQf. 


BENIN G 


a'abcddca 


A DDRESSED 


AABCB 


SJ 


FFERE D 


a'abcddcd 


A DDRESSES 


AABCB 


DI 


FFERE NT 


A,ABCDEB 


CO MMUNIQU E 


AABCB 


0 


FFICI AL 


AABCDEB 


TR OOPSHIP 
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AABCDEB 


A 


SSEMBLE 


ABA 


INVA 


DED 




AABCDEBC 


TR 


OOPSHIPS 


ABA 


LAN 


DED 




AABCDEC 


CO 


MMANDIN G 


ABA 


RAI 


DED 




AABCDECf 


BA 


TTLEFIEL D 


ABA 


WOUN 


DED 




AABCDED 


CO 


MMANDED 


ABA 




DID 




AABCDEDFd 


A 


MMUNITION 


ABA 


IC 


EBE 


RG 


AABCDEE 


CO 


MMANDEE R 


ABA 


PR 


ECE 


DING 


AABCDEFA 


R 


EENLISTE D 


ABA 


R 


ECE 


IPT 


AABCDEFA 


I 


RREGULAR 


ABA 


CR 


EDE 


NTIAL 


AABCDEFB 


0 


FFENSIVE 


ABA 


F 


EDE 


RAL 


AABCDEFBA 


A 


SSEMBLIES 


ABA 


D 


EFE 


AT 


AABCDEFC 


A 


LLOTMENT 


ABA 


D 


EFE 


CT 


AABCDEFC 


C 


OOPERATE 


ABA 


D 


EFE 


R 


AABCDEFD 


I 


LLUSTRAT E 


ABA 


SI 


EGE 




AABCDEFD 


A 


SSIGNMEN T 


ABA 


R 


EJE 


CT 


AABCDEFDGA 


A 


SSIGNMENTS 


ABA 


S 


ELE 


CT 


AABCDEFGA 


C 


OOPERATIO N 


ABA 


T 


ELE 


GRAM 


AABCDEFGABF 


R 


EENLISTMENT 


ABA 




ELE 


VATION 


AABCDEFGD 


BA 


TTLESHIPS 


ABA 


SCH 


EME 




AABCDEFGDAE 


C 


OORDINATION 


ABA 


R 


EME 


DY 


AABCDEFGDE 


A 


PPOINTMENT 


ABA 


DISPLAC 


EME 


NT 


ABA 




AGA IN 


ABA 


PLAC 


EME 


NT 


ABA 




AGA INST 


ABA 




ENE 


MY 


ABA 


C 


ALA MITY 


ABA 


G 


ENE 


RAL 


ABA 




ALA RM 


ABA 


R 


EPE 


L 


ABA 


S 


ALA RY 


ABA 


H 


ERE 




ABA 


D 


AMA GE 


ABA 


SPH 


ERE 




ABA 


M 


ANA GE 


ABA 


TH 


ERE 




ABA 


C 


ANA L 


ABA 


W 


ERE 




ABA 




ANA LYZE 


ABA 


WH 


ERE 




ABA 


J 


APA N 


ABA 


CONQU 


ERE 


D 


ABA 


P 


ARA CHUTE 


ABA 


COV 


ERE 


D ' 


ABA 


P 


ARA DE 


ABA 


TH 


ESE 




ABA 


SEP 


ARA TION 


ABA 


PR 


ESE 


NT 


.ABA 


F 


ATA L 


ABA 


D 


ESE 


RT 


ABA 


N 


AVA L 


ABA 


COMPL 


ETE 




ABA 


N 


AVA LFORCES 


ABA 


KILOM 


ETE 


R 


ABA 


C 


AVA LRY 


ABA 


M 


ETE 


R 


ABA 


EXC 


AVA TION 


ABA 


P 


ETE 


R 


ABA 




AWA IT 


ABA 


D 


EVE 


LOP 


ABA 




AWA RD 


ABA 


S 


EVE 


N 


ABA 




AWA Y 


ABA 


S 


EVE 


NTH 


ABA 


PRO 


BAB LE 


ABA 


S 


EVE 


NTY 


ABA 


PRO 


BAB LY 


ABA 


S 


EVE 


RAL 


ABA 


BI 


CYC LE 


ABA 




EVE 


RY 


ABA 




CYC LONE 


ABA 




EYE 




ABA 


BLOCKA 


DED 


ABA 




FIF 


TH 


ABA 


GROUN 


DED 


ABA 




FIF 


TY 


ABA 


GUAR 


DED 


ABA 


EIG 


HTH 
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ABA 


L 


IAI 


SON 


ABA 


CA 


RTR IDGE 


ABA 


PROH 


IBI 


T 


ABA 


D 


RYR UN 


ABA 


SERV, 


1CI 


NG 


ABA 


DI 


SAS TER 


ABA 


RA 


IDI 


NG 


ABA 


CA 


SES 


ABA 


R 


IDI 


NG 


ABA 


RE 


SIS T 


ABA 


R 


IGI 


D 


ABA 




SUS PEND 


ABA 


F 


ILI 


NG 


ABA 




SYS TEM 


ABA 


M 


ILI 


TARY 


ABA 


S 


TAT ION 


ABA 


MOB 


ILI 


ZE 


ABA 


DIC 


TAT OR 


ABA 


S 


IMI 


LAR 


ABA 




TIT LE 


ABA 


L 


IMI 


T 


ABA 


AL 


TIT UDE 


ABA 


PROX 


IMI 


TY 


ABA 


LA 


TIT UDE 


ABA 


F 


INI 


SH 


ABA 




TOT AL 


ABA 


F 


IRI 


NG 


ABA 




TOT ALING 


ABA 


RET 


IRI 


NG 


ABA 


A 


UGU ST 


ABA 


W 


IRI 


NG 


ABA 




USU AL 


ABA 


V 


ISI 


BLE 


ABA _ 


F 


UTU RE 


ABA 


D 


ISI 


NFECT 


ABA 


SUR 


VIV ED 


ABA 


ADV 


ISI 


NG 


ABAA 


HAV 


EBEE N 


ABA 


DEG 


ISI 


ON 


ABAA 




SESS ION 


ABA 


V 


ISI 


T 


ABAACC 




TATTOO 


ABA 


V 


ISI 


TOR 


ABAB 


DETRA 


ININ G 


ABA 


POL 


ITI 


CS 


ABAB 


L 


ININ G 


ABA 


CR 


ITI 


QUE 


'ABAB 


M 


ININ G 


ABA 


POS 


ITI 


VE 


'ABAB 


OBTA 


ININ G 


ABA 




MEM 


ORIAL 


ABAB 


RA 


ININ G 


ABA 




NAN 




ABAB 


REMA 


ININ G 


ABA 


DOMI 


NAN 


CE 


ABAB 


TRA 


ININ G 


ABA 


ORD 


NAN 


CE 


ABAB, 


CR 


ISIS 


ABA 


DOMI 


NAN 


T 


ABAB 


WI 


THTH E 


ABA 




NIN 


E 


ABAB 


PAR 


TITI ON 


ABA 




NIN 


ETY 


ABACA 


C 


ANADA 


ABA 


MOR 


NIN 


G 


ABACA 


P 


ANAMA 


ABA 




NIN 


TH 


ABACA 


PR 


ECEDE 


ABA 




OBO 


E 


ABACA 




ELEME NT 


ABA 


C 


OLO 


N 


ABACA 




ELEME NTARY 


ABA 


SEMIC 


OLO 


N 


ABACA 




ELEVE N 


ABA 


C 


OLO 


RS 


ABACA 


C 


EMETE RY 


ABA 


AUT 


OMO 


BILE 


ABACA 


S 


EVERE 


ABA 


PR 


OMO 


TE 


ABACA 


AUD 


IBILI TY 


ABA 


H 


ONO 


R 


ABACA 


EXH 


IBITI ON 


ABA 


VIG 


ORO 


US 


ABACA 


V 


ICINI TY 


ABA 


M 


OTO 


R 


ABACA 


M 


ILITI A 


ABA 


M 


OTO 


RIZED 


ABACA 


FAC 


ILITI ES 


ABA 


PR 


OVO 


ST 


ABACA 


D 


IMINI SH 


ABA 




PIP 


E 


ABACA 


L 


IMITI NG 


ABA 




POP 


ULATED 


ABACA 




INITI AL 


ABA 


LIB 


RAR 


Y 


ABACA 


DEF 


INITI ON 


ABA 


AI 


RDR 


OME 


ABACA 


D 


IRIGI BLE 
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MISCELLANEOUS 


ABACA 


SEM IRIGI D 


ABACA 


REQU ISITI ON 


ABACA 


C IVILI AN 


ABACA 


D IVISI ON 


ABACA 


L OCOMO TIVE 


ABACA 


M ONOPO LY 


ABACA 


PR OTOCO L 


ABACA 


CONS TITUT E 


ABACA 


UNUSU AL 


ABACADA 


V ISIBILI TY 


ABACADB 


DEF INITION 


ABACADBA 


PR ECEDENCE 


ABACADC 


INITIAT E 


ABACADD 


COMPL ETENESS 


ABACADDA 


N AVALATTA CK 


ABACADEC 


D IVISIONS 


ABACB 


V ACANC Y 


ABACB 


COMB ATANT 


ABACB 


C ATAST ROPHE 


ABACB 


D ETECT OR 


ABACB 


V ISITS ' 


ABACB 


MEMBE R 


ABACBDEC 


D ETENTION 


ABACBDEC 


R ETENTION 


ABACBDEFGFAG 


NONCOMBATANT 


ABACC 


R EBELL ION 


ABACC 


N ECESS ARY 


ABACC 


N ECESS ITY 


ABACC 


CAR ELESS 


ABACC 


WIR ELESS 


ABACCA 


P ARALLA X 


ABACCA 


R EPELLE D 


ABACCA 


T OMORRO W 


ABACCDACC 


CAR ELESSNESS 


ABACCDC 


P ARALLEL 


ABACCDEFEA 


N ECESSITATE 


ABACDA 


' ALASKA 


ABACDA 


ARABIA i 


ABACDA 


N AVALBA £E 


ABACDA 


R ECEIVE „ 


ABACDA 


D ECEMBE R 


ABACDA 


D EFENSE 


ABACDA 


R EJECTE D 


ABACDA 


R ELEASE 1 


ABACDA 


S ELECTE t) 


ABACDA 


R EMEDIE S 


ABACDA 


EMERGE NCY 


ABACDA 


ENEMIE S 


ABACDA 


R EPEATE D 



PATTERNS— Continued 



ABACDA 


R 


EVENUE 


ABACDA 


U 


NKNOWN 


ABACDA'; 1 


PR 


OMOTIO N 


ABACDAAC 


S 


EVENTEEN 


ABACDAACD 


S 


EVENTEENT H 


ABACDAC 


D 


ESERTER 


ABACDAD 


D 


EFENSES 


ABACDAED 




AVAILABL E 


ABACDAEEC 


N 


AVALBATTL E 


ABACDB 


F 


ATALIT Y 


ABACDB 


A 


NONYMO US 


ABACDB 


C 


OLONEL 


ABACDBA 


TH 


EREFORE 


ABACDC 


R 


ECEIVI NG 


ABACDC 


DYNA 


MOMETE R 


ABACDCA 


L 


IMITATI ON 


ABACDCCA 




NINETEEN 


ABACDCCAD 




NINETEENT H 


ABACDCEA 


S 


TATEMENT 


ABACDCECFGHIE 


M 


ETEOROLOGICAL 


ABACDD 




FIFTEE N 


ABACDD 


FO 


RTRESS 


ABACDDEC 




FIFTEENT H 


ABACDEA 




ELEVATE 


ABACDEA 


D 


EVELOPE 


ABACDEA 


VER 


IFICATI ON 


ABACDEA 


S 


IMILARI TY 


ABACDEAD 




SUSPENSE 


ABACDEAFGE 




SUSPENSION 


ABACDEB 


EXPL 


ANATION 


ABACDEB 


T 


OPOGRAP HIC 


ABACDEBFA 


R 


ECEPTACLE 


ABACDEC 




ABANDON 


ABACDEC 


D 


AMAGING 


ABACDEC 


QU 


ARANTIN E 


ABACDECA 


P 


ENETRATE 


ABACDECFBA 


D 


ETERIORATE 


ABACDECFGB 


P 


ENETRATION 


ABACDED 


C 


APABILI TY 


ABACDED 


M 


OTORCYC LE 


ABACDED 




SUSPICI ON 


ABACDEDEDC 


G 


ENERALALAR M 


ABACDED^BA 




SUSPICIOUS 


ABACDEDFGA 




SUSPICIONS 


ABACDEFA 


D 


EFECTIVE 


ABACDEFA 


D 


EFENSIVE 


ABACDEFA 


T 


ELEPHONE 


ABACDEFA 


D 


ETERMINE 


ABACDEFA 


D 


EVELOPME NT 
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AEjAlCDEFA 


" EXERCISE ' 


ABBA 


SH IPPI NG 


ABACDEFAF* 


" EXERCISES 


ABBA 


M ISSI NG 


ABACDEFB 


DEDICATE 


ABBA 


ADM ISSI ON " 


ABACDEFB 


ENEMYTAN KS 


ABBA 


M ISSI ON 


ABACDEFC 


DEDICATI ON 


ABBA 


PERM ISSI ON 


ABACDEFCDFE 


V ETERINARIAN 


ABBA 


F ITTI NG 


ABACDEFCFD 


ELECTRICIT Y 


ABBA 


AFTER NOON 


ABACDEFD 


SUSPECTE D 


ABBA 


NOON 


ABACDEFDF 


SUSPENDED 


ABBA 


F OLLO If 


ABACDEFE 


ANALYSIS 


ABBA 


C OMMO N 


ABACDEFGA 


EXECUTIVE 


ABBA 


OPPO SE 


ABACDEFGB 


POPULATIO N 


ABBA 


OPPO SITE " 


ABACDEFGBA 


ENEMYPLANE S 


ABBA • 


B OTTO M 


ABACDEFGBA 


S EVENTYFIVE 


ABBAB 


B AGGAG E 


ABACDEFGBEHF 


D ETERMINATION 


ABBAB 


WITN ESSES 


ABACDEFGDHH 


G ENERALSTAFF 


ABBACA 


APPARA TUS 


ABACDEFGE 


MEMORANDA 


abbacA 


L ETTERE D 


ABACDEFGHA 


MEMORANDUM 


ABBACB 


V ESSELS 


ABACDEFGHIA 


D ECENTRALIZE 


ABBACDA 


M ESSENCE R 


ABBA 


AFFA IR 


ABBACDA 


EFFECTE D 


ABBA 


APPA RENT 


ABBACDB 


M ISSIONS 


ABBA 


APPA RENTLY 


abbacdeA 


IRRIGATI ON 


ABBA 


B ARRA CKS 


ABBACDEDA 


OPPOSITIO N 


ABBA 


B ARRA GE 


abbacdefA 


EFFECTIVE 


ABBA 


ARRA NGE 


ABBACDEFA 


D IFFICULTI ES 


ABBA 


P ASSA GE 


ABBACDEFA 


IMMIGRATI ON 


ABBA 


ASSA ULT 


ABBACDEFCD 


ILLITERATE 


ABBA 


ATTA CH 


ABBACDEFDfl 


ATTAINMENT 


ABBA 


ATTA CK 


ABBACDEFEd 


ARRANGEMEN T 


ABBA 


ATTA IN 


ABBACDEF&& 


ATTACHMENT 


ABBA 


B ATTA LION 


ABBCA 


ANNUA L 


ABBA 


IN DEED 


ABBCA 


APPEA R 


ABBA 


EFFE CT 


ABBCA 


DIS APPEA R 


ABBA 


COMP ELLE D 


ABBCA 


C ARRIA GE 


ABBA 


SH ELLE D 


ABBCA 


S ETTLE 


ABBA 


CONF ERRE D 


ABBCA 


ISSUI NG 


ABBA 


COMPR ESSE D 


ABBCA 


FOUR TEENT H 


ABBA 


IMPR ESSE D 


ABBCA 


SIX TEENT H 


ABBA 


PR ESSE D 


ABBCA 


CHA UFFEU R 


ABBA 


V ESSE L 


ABBCA 


S URROU ND 


ABBA 


CIGAR ETTE 


ABBCADAEF<? 


APPEARANCE 


ABBA 


B ETTE R 


abbcAdAefc 


DIS' APPEARANCE 


ABBA 


L ETTE R 


ABBCADC 


APPEARE D 


ABBA 


D IFFI CULT 


ABBCBBDA 


P OSSESSIO N 


ABBA 


W ILLI AM 


abBcbda 


ASSISTA NCE 


ABBA 


F ILLI NG 


ABBCBDAED 


ASSISTANT 


ABBA 


K ILLI NG 


ABBCCDAB 


ASSOONAS 


ABBA 


REF ILLI NG 


ABBCDA 


ALLOWA NCE 


ABBA 


SW IMMI NG 


ABBCDA 


APPROA CH 
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ABBCDA 


ARRIVA L 


ABCA 




ADVA 


NCE 


ABBCDA 


ASSURA NCE 


ABCA 


DI 


AGRA 


M 


ABBCDA 


M ESSAGE 


ABCA 


EV 


ALUA 


TION 


v ABBCDA 


ILLUMI NATE 


ABCA 




ALWA 


YS 


ABBCDAB 


M ESSAGES 


ABCA 


C 


AMPA 


IGN 


ABBCDAB 


C ORRIDOR 


ABCA 


M 


ANDA 


TE 


ABBCDAEA 


B ELLIGERE NT 


ABCA 


M 


ANUA 


L 


ABBCDAEFC 


ALLOCATIO N 


ABCA 


J 


ANUA 


RY 


ABBCDAEFC 


IMMEDIATE 


ABCA 


C 


ANVA 


S 


ABBCDAEFGAE 


ILLUMINATIN G 


ABCA 


CH 


APLA 


IN 


ABBCDAEFGAHE 


ILLUMINATION 


ABCA 


C 


APTA 


IN 


ABBCDAEFGAHE 


D ISSEMINAJION 


ABCA 




AREA 


r 


ABBCDBCEA 


APPRQjPRlA TE 


ABCA 


DEB 


ARKA 


TION 


ABBCDCA 


EFFICIE NT 


ABCA 


EMB 


ARKA 


TION 


ABBCDCA 


C OLLISIO N 


ABCA 




ASIA 




ABBCDCAED 


EFFICIENC Y 


ABCA 


CO 


ASTA 


L 


ABBCDCAED 


C OLLISIONS 


ABCA 


c 


ASUA 


L 


ABBCDCEFA 


ADDITIONA L 


ABCA 


c 


ASUA 


LTY 


ABBCDDCA 


C OMMISSIO N 


ABCA 




AVIA 


TOR 


ABBCDDCA 


C OMMISSIO NER 


ABCA 




BARB 


ED 


ABBCDDCEAFGC 


ACCOMMODATIO N 


ABCA 




BOMB 




ABBCDEA 


ACCOMPA NY 


ABCA 




BOMB 


ARD 


ABBCDEA 


APPROVA L 


ABCA 




BOMB 


ER 


ABBCDEA 


ASSOCIA TE 


ABCA 


LIGtfT 


BOMB 


ER 


ABBCDEA 


SH ELLFIRE 


ABCA 




BRIB 


E 


ABBCDEA 


T ERRIBLE 


ABCA 




BULB 




ABBCDEAFB 


ACCORDANC E 


ABCA 




CANC 


EL 


ABBPDEAFB 


REENFORCfe 


ABCA 




CHEC 


K 


ABBCDEAFBC 


ACCEPTANCE 


ABCA 




CIRC 


LE 


ABBCDEAFBGBC 


REENFORCEMEN T 


ABCA 




CIRC 


ULATE 


ABBCDEAFD 


APPLICATI ON 


ABCA 




CONC 


EAL 


ABBCDEAFEC 


ASSOCIATIO N 


ABCA 




CONC 


LUDE 


ABBCDEAFGC 


ACCEPTABLE 


ABCA 


HUN 


DRED 




ABBCDEAFGC 


ALLEGIANCE 


ABCA 


L 


EADE 


R 


ABBCDEAFGHF 


C ORRESPONDIN G 


ABCA 




EAGE 


R 


ABBCDEFGA 


ACCIDENTA L 


ABCA 


M 


EAGE 


R 


ABBCDEFGA 


APPROXIMA TE 


ABCA 


S 


EAME 


N 


ABBCDEFGA 


OCCUPATIO N 


ABCA 


ST 


EAME 


R 


ABBCDEFGBAHAC 


IRREGULARITIE S 


ABCA 


N 


EARE 


ST 


ABBCDEFGBA 


IRREGULARI TY 


ABCA 


C 


EASE 




ABBCDEFGEA 


ILLUSTRATI ON 


ABCA 


GR 


EASE 




ABBCDEFGHAD 


C OMMENDATION 


ABCA 


INCR 


EASE 


D 


ABCA 


P ACKA GE 


ABCA 


L 


EAVE 




ABCA 


EV ACUA TING 


ABCA 




ECHE 


LON 


ABCA 


EV ACUA TION 


ABCA 


WR 


ECKE 


D 


ABCA 


R ADIA L 


ABCA 


INF 


ECTE 


D 


ABCA 


R ADIA TE 


ABCA 




EDGE 




ABCA 


ADJA CENT 


ABCA 


S 


EIZE 




ABCA 


GR ADUA L 


ABCA 


R 


ELIE 


F 
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ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 
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ABCA 
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ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA 

ABCA’ 

ABCA 

ABCA 

ABCA 

ABCA 
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H 


ELBE R 


ABCA 


I 


NFAN TRY 


Tff 


ELVE 


ABCA 


CO 


NFIN E 


NOV 


EMBE R 


ABCA 


U 


NION 


ABS 


ENCE 


ABCA 


SU 


NKEN 


LIC 


ENSE 


ABCA 


FLA 


NKIN G 


C 


ENT# R 


abcA 


I 


NLAN D 




ENTE R 


ABCA 


I 


NTEN D 




ENVE LOP 


abcA 


CO 


NTIN UAL 


R 


EQUE ST 


ABCA 


CO 


NTIN UE 


FI 


ERCE 


ABCA 


I 


NVEN T 


S 


ERGE ANT 


ABCA 




OCTO BER 


MAT 


ERIE L 


ABCA 


D 


OCTO R 


REV 


ERSE 


ABCA 


F 


OGHO RN 


OBS 


ERVE 


ABCA 


P 


OISO N 


R 


ESPE CT 


ABCA 


C 


OMPO SED 


W 


ESTE RLY 


ABCA 


C 


ONVO Y 


W 


ESTE RN 


ABCA 


EN 


ORMO US 




ETHE 'ft “ 


ABCA 


EXPL 


OSIO N 


MAN 


EUVE R 


ABCA 




PUMP 


R 


EVIE W 


ABCA 




PURP OSE 




EXCE PT 


ABCA 


HA 


RBOR 




EXPE CT 


,ABCA 


AI 


RBOR NE 




EXPE ND 


ABCA 


MU 


RDER 




EXTE ND 4 


ABCA 


0 


RDER 




gaug’e 


ABCA 


0 


RDER S 




GEOG RAPHIt 


ABCA 




REAR 


FOR 


GING 


ABCA 




RECR UIT 


W 


HICH 


ABCA 


COU 


RIER 




HIGH 


ABCA 


P 


RIOR 




HIGH ER 


ABCA 


SUPE 


RIOR 




HIGH EST 


ABCA 


A 


RMOR 


V 


ICTI M 


ABCA 


A 


RMOR Y 


M 


IDNI GHT 


ABCA 


P 


ROGR AM 


DR 


IFTI NG 


ABCA 


MO 


RTAR 


L 


IFTI NG 


ABCA 


' QUA 


RTER 


S 


IGNI FY 


ABCA 


QUA 


RTER S 


BU 


ILDI NG 


ABCA 


FEB 


RUAR Y 




INDI CATE 


ABCA 


FO 


RWAR D 




INDI RECT 


ABCA 


CEN 


SORS HIP 


DESCR 


IPTI ON 


ABCA, 




SUNS ET 


L 


IQUI D 


ABCA 


IMPOR 


TANT 


A 


IRFI ELD 


ABCA 


S 


TART 




REPR ISAL 


ABCA 


PRO 


TECT 


M 


ISFI RE 


ABCA 




TENT 


F 


ISHI NG 


ABCA 




TENT H 


W 


ITHI N 


ABCA 


PRO 


TEST 


FUE 


LOIL 


ABCA 




TEXT 




MAIM 


ABCA 




THAT 


LA 


NDIN G 


|BCA 


S 


TRAT EGIC 
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ABCA 


S 


TRAT EGY 


ABCAC 


P 


ABCA 


D 


UGOU T 


ABCAC 


PRO 


ABCA 




UNSU ITABLE 


ABCACA 


D 


ABCA 


P 


URSU E 


ABCACB 


0 


ABCA 


P 


URSU IT 


ABCACBDEC 




ABCA 
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IM 


MI 


-R 


EP 


EL 


LE 


D- 












IM 


MI 



D- 

DI N- 

D- 

D- 

D- 

D- 

D- 

D- 

NC E- 

NT IA L- 

S- 

D- 

D- 

D- 

N- 

NG ER 

D- 

D- 

D- 

D- 

S- 

L— 

S- 

D- 

S- 

R- 

R- 

RE E- 

NG 

NG 

CU LT 
ON 
NG 
NG 

ME TE R- 

NG 

ON 

NG 

NG 

NG 

AM 

NG 

GR AN T- 
GR AT XO N~ 
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ftEOTRICTE B- ' 









AB 


BA 












AB 


— 


BA 












IM 


MI 


NE 


NT 






PR 


AC 


TI 


CA 


BL 


E- 






SW 


IM 


MI 


NG 








PR 


AC 


TI 


CA 


L- 






-B 


EG 


IN 


NI 


NG 








-T 


AC 


TI 


CA 


L- 








SP 


IN 


NI 


NG 






-D 


IV 


EB 


OM 


BE 


R- 








-W 


IN 


NI 


NG 










EN 


GI 


NE 


ER 








CL 


IP 


PI 


NG 








-G 


EN 


UI 


NE 










SH 


IP 


PI 


NG 






-I 


NT 


ER 


FE 


RE 








-S 


TR 


IP 


PI 


NG 






-I 


NT 


ER 


FE 


RE 


NC 


E- 








IR 


RI 


GA 


TI 


ON 


-P 


EN 


ET 


RA 


TE 










-M 


IS 


SI 


NG 








-R 


EV 


OL 


VE 


R— 








-M 


IS 


SI 


ON 










IN 


FI 


NI 


TE 






-A 


DM 


IS 


SI 


ON 








-D 


IS 


PO 


SI 


TI 


ON 






EM 


IS 


SI 


ON 








-S 


IT 


UA 


TI 


ON 








-H 


IS 


SI 


NG 








CA 


NA 


DI 


AN 








PE 


RM 


IS 


SI 


ON 






VE TE 


RI 


NA 


RI 


AN 






TR 


AN 


SM 


IS 


SI 


ON 








NI 


NE 


TE 


EN 










EM 


IT 


TI 


NG 








NI 


NE 


TE 


EN 


TH 








-F 


IT 


TI 


NG 










PE 


RC 


EP 


TI 


ON 




-S 


PL 


IT 


TI 


NG 








-P 


RE 


MI 


ER 








PE 


RM 


IT 


TI 


NG 






-S 


UR 


RE 


ND 


ER 






-A 


FT 


ER 


NO 


ON 








-0 


UR 


SE 


LV 


ES 








FO 


RE 


NO 


ON 








TH 


EM 


SE 


LV 


ES 












NO 


ON 


TI 


ME 






DE 


SE 


RV 


ES 










-F 


OL 


LO 


W- 








RE 


SE 


RV 


ES 










-H 


OL 


LO 


W- 










SE 


RV 


ES 










-C 


OM 


MO 


N- 
























-C 


OM 


MO 


TI 


ON 


















PO 


SI 


TI 


ON 


NO 


RT 


HO 


F- 


















-R 


EC 


ON 


NO 


IT 


ER 
























OP 


PO 


RT 


UN 


E- 






















OP 


PO 


RT 


UN 


IT Y- 






















OP 


PO 


SE 


























OP 


PO 


SI 


TE 
























OP 


PO 


SI 


TI 


ON 




















-C 


OR 


RO 


BO 


RA 


TE 




















-C 


OR 


RO 


DE 






















-T 


OM 


OR 


RO 


W- 
























-B 


OT 


TO 


M- 
























-C 


OT 


TO 


N- 
























CA 


RE 


ER 


























-S 


UC 


CU 


MB 


ED 
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AB 




— 


BA 


DE 


BA 


RK 


ED 




DE 


CL 


AR 


ED 




DE 


FE 


ND 


ED 




DE 


MA 


ND 


ED 




DE 


PA 


RT 


ED 




DE 


PL 


OY 


ED 




DE 


PO 


RT 


ED 




DE 


SE 


RT 


ED 




DE 


TA 


CH 


ED 




EC 


ED 


EN 


CE 




EM 


PL 


OY 


ME 


NT 


EN 


TR 


AI 


NE 


D- 


ME 


AS 


UR 


EM 


EN T- 


NE 


GL 


IG 


EN 


CE 


NO 


TA 


TI 


ON 




PA 


RA 


GR 


AP 


H- 


RE 


CE 


IV 


ER 




RE 


CO 


RD 


ER 




RE 


GI 


ST 


ER 




RE 


PE 


AT 


ER 




RE 


PO 


RT 


ER 




RE 


VO 


LV 


ER 




RO 


JE 


CT 


OR 




SE 


MB 


LI 


ES 





AB 


— 


— 




BA 




DE 


SE 


CR 


AT 


ED 






DE 


SI 


GN 


AT 


ED 






DE 


SP 


AT 


CH 


ED 






EN 


EM 


YP 


LA 


NE 


S- 


-D 


ET 


ER 


10 


RA 


TE 




— s 


EV 


EN 


TY 


FI 


VE 






IR 


RE 


GU 


LA 


RI 


TY 




NO 


MI 


NA 


TI 


ON 






SU 


sp 


IC 


10 


US 





AB — BA 

DE MO NS TR AT ED 
NO TI FI CA TI ON 
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RESTRICTED 




F. 



DIGRAPHIC IDIOMORPBS : FOUR-SQUARE 1 



(Grouped by number of significant letters in the xdiomorphic pattern) 



Two letters 











A- 


A- 










A- 


A- 










A- 


— 


— 


A- 








B 


LO 


CK 


AD 


ED 










SQ 


UA 


DR 


OH 






MO 


VE 


ME 


NT 










I 


HV 


AD 


ED 






FI 


GH 


TE 


RP 


LA 


NE 




E 


MP 


LA 


CE 


ME 


NT 










D 


AM 


AG 


E 






MO 


TO 


RI 


ZE 


D 






PE 


RS 


ON 


NE 


L 








CO 


MM 


AH 


DS 






D EP 


AR 


TU 


RE 








A 


RT 


IL 


LE 


RY 










I 


SL 


AH 


DS 










UN 


US 


UA 


L 




















A 


IR 


PL 


AH 


ES 






















A- 




— ■ 


-- 


A- 






E RE 


MX 


PL 


AH 


ES 








A- 


_ ~ 


A- 










CO 


MM 


UN 


IC 


AT 


10 


NS 


DE 


SI 


GN 


AT 


ED 






S 


AB 


w 


AG 


E 








CO 


NC 


EN 


TR 


AT 


E 




E 


ST 


IM 


AT 


ED 






D ET 


AC 


HM 


EN 


T 






R 


EO 


PG 


AH 


IZ 


AT 


10 


N 


I 


HD 


IC 


AT 


ED 






H 


AS 


BE 


EN 










LI 


EU 


TE 


NA 


NT 










C 


AV 


AL 


EY 






BA 


TO 


AL 


10 


H 




CO 


NS 


TR 


UC 


TI 


ON 










N 


AV 


AL 








BO 


MB 


ED 


























P 


RO 


CE 


DU 


EE 






CA 


SU 


AL 


TI 


ES 






A- 


-- 






— 


A- 








ME 


CH 


AH 


IZ 


ED 




CA 


su 


AL 


TY 








CO 


MM 


IS 


SI 


ON 


ED 






B1 


ME 


DI 


AT 


EL 


Y 




CO 


MB 


AT 


























HI 


TH 


DR 


AU 








CO 


OR 


DI 


HA 


TE 


S 








-B 


-B 










¥1 


TH 


DR 


EW 








DI 


RE 


CT 


10 


N 








UN 


AB 


LE 














EM 


ER 


GE 


NC Y 




DI 


SP 


AT 


CH 








OB 


ST 


AC 


LE 








L 


IE 


UT 


EH 


AH 


T 




ME 


DI 


UM 


BO 


MB 


ER 










AD 


VA 


NC 


E 










FI 


FT 


EE 


N 




DI 


VE 


BO 


MB 


ER 










AG 


AI 


NS 


T 










FI 


FT 


H 




R OA 


DJ 


UN 


CT 


10 


N 








R 


AI 


LH 


EA 


D 










FI 


FT 


Y 




R 


EP 


LA 


CE 


ME 


HT 






PR 


EP 


AR 


AT 


10 


N 






BR 


ID 


GE 


HE 


AD 




R 


ET 


RE 


AT 










A 


SS 


AU 


LT 












V 


IC 


IN 


IT 


Y 


S 


EV 


ER 


AL 










B 


OM 


BA 


RD 












¥ 


IT 


HD 


RA 


¥ 


JU 


NC 


TI 


ON 










A 


IR 


BO 


RN 


E 








A 


DD 


IT 


10 


HA 


L 


CO 


NF 


IR 


MA 


TI 


ON 






S 


EA 


BO 


RN 


E 






A 


MM 


UR 


IT 


10 


N 




I 


NF 


OR 


MA 


TI 


ON 




A 


DV 


AH 


Cl 


HG 










CO 


HD 


IT 


10 


H 




I 


HT 


EL 


LI 


GE 


NC 


E 






VI 


Cl 


NI 


TY 






RE 


CO 


GH 


IE 


10 


H 






PA 


TR 


OL 














DE 


TA 


CH 










E 


LE 


ME 


HT 






SA 


BO 


TA 


GE 












DE 


TA 


CH 


ME 


NT 








MI 


LI 


TA 


RY 




SE 


VE 


EE 










II 


AV 


EB 


EE 


N 












MI 


HI 


MU 


M 


AC 


TI 


VI 


TY 










M 


OV 


EM 


EN 


T 












HI 


NT 


H 




A 


TT 


EH 


TI 


OH 












EN 


EM 


Y 










P 


01 


NT 






s 


UC 


CE 


SS 


FU 


LL 


Y 






R 


ES 


ER 


VE 










T 


OM 


OR 


RO 


¥ 




















R 


ET 


UR 


N 












PO 


NT 


ON 






A- 


-- 


-- 


A- 












FL 


AN 


K 












RE 


QU 


ES 


T 




AR 


TI 


LL 


ER 


Y 










FO 


LL 


0¥ 












RE 


QU 


IR 


E 




AT 


TA 


CK 


ED 








B 


AG 


' rti. 


GE 












P 


RI 


SO 


HE 


R 


R 


EE 


NF 


OR 


CE 












HA 


SB 


EE 


N 








RE 


SI 


ST 


AH 


CE 


R 


EE 


NF 


OR 


CE 


ME 


NT 


A PP 


RO 


AC 


HI 


1\0 








D 


IS 


PO 


SI 


TI 


OH 






ID 


EH 


TI 


FY 






DE 


BO 


UC 


HI 


HG 












PO 


SI 


TI 


OH 






IM 


PA 


SS 


IB 


LE 




L 


AU 


NC 


HI 


NG 














SO 


UT 


H 






IM 


PO 


SS 


IB 


LE 




I 


MM 


ED 


IA 


TE 


LY 







1 See subpar, ; Section IX. 
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Two letters (cont.) 




-B -B 


-B -B 


-B B 


IN IT IA TE 


P ER SO NN EL 


I DE NT IF IC AT 10 N 


F IF TH 


ES TI MA TE DA T 


M EO HA NI ZE D 


TE RR IT OR Y 


P LA TO ON 


D EP LO YM EN T 


S IX TY 


S UP PL Y 


M ES SE NG ER 


M IS CE LL AN EO US 


S UP PO RT 


D ES TR OY ER 


E LE VA TI ON 


NA VA LB AS E 


A IR SU PP OR T 


E EE VE N 


F OR NA RD 


V IS IB IL IT Y 


LI AI SO N 


UI HD WA RD 


ME SS EN GE R 


DA MA GE 




I MP AS SI BL E 


MO RN IN G 


I 

* 


I MP OS SI BL E 


U NU SU AL 


C AS UA LT Y 


A NT IA IR CR AF T 


OB JE CT IV E 


P AT RO LS ' 


C OM MA ND IN G 


C OL ON 


B AT TL ES HI PS - 


OP ER AT 10 N 


C OL ON EL 


GE HE BA L 


PR IS ON ER 


SU PE KE OR IT Y 


W IL LA TT AC K 


PR OC ED UR E 


M OT OR IZ ED 


T RA NS MI SS !lO N 


RE EN FO RC E 


OU TS KI ET S 


R EC OG NI TI ON 


TR AN SP OR TA TI ON 


EQ UI PM EN T 


T RO OP SH IP 


YE ST ER DA Y 


A ^ RA GE 


RE GI ME NT 




B AR RA GE 1 


CA RR IE ES' 


-B -B 


AI RG RA PT 


MI SS 10 NS 


R EC OM ME IS3 ED 


AN TI AI RC RA ST 


OT EN TY 


HE AV YL OS SE S 


RE MA IN 


R EQ UE ST ED 


R EC OM MB ND AT 10 N 


R EQ UI RE ME NT 




C OM MU NI CA TI ON 


M IS SI NG 




R EC ON NO IT ER IN G 




Three letters 




A- A- A- 


A- A A- 


-B -B -B 


N AV AL BA SE 


EE QU ES TE|D 


B OM BA RD ME NT 


R EQ UI SI TI ON 




EL EM EN TS 






EN GA GE ME Iff 




Four letters 




AB A- -B 


A- AB -B 


A- -B AB 


H EA DQ UA RT ER S 


AD DI TI ON AL 


M OR HI NG 


EL EV EN 




P OS TP ON E 




A- AB B , 




AB -B A- 


SO UT m ES T 


A- -B -B — A- 


CA NC EL 




RE CO UN 01 TE R 


RE CO NN AI SS AN Cl 


A- A- -B -B | 






W IT HD RA NA ju 


A- -B — AB 


AB -B — A- 




3ZT TE RD IC T 


AD VA NC ED 


irjxesssn. 




Eli EM YP All KS 


CO HI AN DI I7G 


A- -B -- A- -B 






S AT IS FA CT OR Y 


AB — A- -B 


A- A B -B 




SI GH TI NC- 


RE QU IR EM iN T 


A A- C- C- 






| DI SP AT CH ES 
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REOTniGTED 

A C- A- C- 


Four letters (cont.) 
-B A- — AB 


-B -D -D -B 


RO AD JU NC TI ON 
-B AB A- 


U NS UC CE SS FU L 

i 

-B A A- -B 


AI RS UP PO RT 
-B -D D -B 


DI SP OS IT 10 U 
P OS H 10 N 
PR ES EN T 
EE PR ES EH % 

-B A- AB 


•HE DI jUM BO MB ER 

/ 

-B A- -- -B A- 
VI SI BI LI TY 

-B A AB 


IN ST RCJ CT 10 N 
C ON ST REJ CT 10 N 

-B — A- AB 
F IG BT ER PL AN ES 

-B A AB 

E ST AB LI SH MS NT 

-B B A- A- 

EN CO UN TE RE D 

-B B -D -D 

RE IN FO RC EM EN T 


RE PE AT ED 
-B A- A- -B 


IN FO BH AT 10 N 
-B A — A- -B 


DE ST RO 303 R 
-B A- -B — A- 


IN ST AL LA TI ON 
-B -D -B D 


UN ID EE TI FI ED 


CR OS SR OA DS 




/ 

Five letters 




A- -B AB — -B 


-B A- A AB 


-B -D D -B -D 


NA VA IA TT AC K 


DI ST RI BU TI ON 


IN ST HD CT 10 NS 


A- -B B AB 


-B A- -B AB 




R EC ON NA IS SA NC E 


RE PL AC EM EN T 





Six letters 



AB CB C- A- 


A- A- -B AB A- 


A CB A CB 


P OS IT 10 NS 
AB -D -D AB 


RE QU IS IT 10 N 
A- CB — A- CB 


ID EN TI FI CA TI ON 
-B AB AD -D 


C ON DI TI ON 
RA DI OG RA M 


Q UA RT ER MA ST ER 

A- CB — CB A- 
SC HO OL HO US E 


A DM IN IS TR AT IV E 



Seven letters 



-B AD B -D AD 

RE ED FO RC EM EN T 



Eight letters 



AB -B AD B AD 


AB -B C- AB CB 


AB -D C- AD C- -B 


QU AR TE RM AS TE R 


EM PL AC EM EN T 


IN TE RD IC TI OH 
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APPENDIX k 



, - , SERVICE TERMINOLOGY & STEREOTYPES , , ‘ *, , 

' — f t >4* -i “ v Jt -■'M ^ 

Familiarity with the style and peculiar phraseology which exist in 
military messages greatly facilitates the cryptanalytic recovery of the 
plain text of any such messages which have been encrypted. Thus# this 
appendix has been compiled to comprise notes on those idiosyncrasies 
present in military messages which are of particular interest end aid 
to the cryptanalyst. The notes which are applicable to the messages 
of all Services are grouped together in Section A, those which are 
applicable to messages of ground, naval, and air origin, respectively, 
constitute Sections B, C, and Dj those which apply to special types of 
messages, such as weather messages, are contained in Section E; and 
remarks on stereotypic beginnings and endings of messages comprise 
Section F. 



Although the notes contained herein derive primarily from U. S. 
military communications, many apply as well to the military communi- 
cations of other countries. At the very least, this appendix Indicates 
the types of information on message style and phraseology which, when 
known concerning the messages of any source, can be quite helpful in 
the cryptanalysis of such messages. 



r a 
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* 



• • * * \ 
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A* REMARKS APPLICABLE TC^ElSsAfi&E* ^ili-Q^^TICFS 



1* When mention is made of time in military messages, it is t con- 
ventionally specified in terras of the 2ii-hour clock system (ending at 
midnight), in which time is expressed as a group of four numerals. The 
first two numerals of the group denote the hour and the last two numer- 
als, the minute after the hour; for example, 6:00 AM appears as 0600 
and 6:00 PM appears as 1800. For any current month, the day may be indi- 
cated by prefixing the four-digit time group with a two-digit date group, 
indicating the day of the month; for example, 080600 denotes 6:00 AM on 
thfe 8th day of the month. In some Instances, a four-digit time group or 
Six-digit date-time group occurring in a message may be found with a 
literal suffix, giving rise to such groups as 1800Z, 080600Q, etc; this 
suffix may be any one of the letters A to I or K to Z end is a type of 
designator used in communications practices to refer to a particular one 
of the 24 time zones of the earth. 



2. Administrative messages in general often contain many sequences 
of numbers, brought about by numeiouB references to previous messages and 
to various Service regulations (among other items), reference generally 
being made on the basis of identifying serial numbers and dates which 
such items usually hear; specific illustrations of this fact appear in 
several of the succeeding paragraphs in this appendix. Furthermore, 
administrative messages contain references to items having literal desig- 
nations; to minimize errors in this connection such designations are 
often spelled out phonetically, by means of a’ phonetic alphabet, such as 
one' of the following: 



ABLE 


JIG 


SUGAR 


ALFA 


JULIETT 


SIERRA 


BAKER 


KING 


TARE 


BRAVO 


KILO 


TANGO 


CHARLIE 


LOVE 


UNCLE 


COCA 


LIMA 


UNION 


DOG 


MIKE 


VICTOR 


DELTA 


METRO 


VICTOR 


EASY 


NAN 


WILLIAM 


ECHO 


NECTAR 


WHISKEY 


FOX 


OBOE 


XRAY 


FOXTROT 


OSCAR 


EXTRA 


GEORGE 


PETER 


YOKE 


GOLF 


PAPA 


YANKEE 


HOW 


QUEEN 


ZEBRA 


I?0TEL 


Quebec 


ZULU 


ITEM 


ROGER 


\ « 


INDIA 


ROMEO 

' s " rtl ■"!>'» 




i. The messages 


of all Services exhibit a high content of 


abbrevi- 



atlons; for this reason, the following list of frequently-encountered 
Abbreviations is included: 



WAVY OFFICER RAMS 

FARM. .. .fleet admiral 

ADM admiral 

VADM. • • .vice admiral 
RADM....rear admiral 

C01 10. . . . commodore 
CAPT.... captain 

CDR. ... .commander 

LCDR....lt. commander 

LT. .... .lieutenant 

LTJG. • • .lleut . Jr. grade 
EIJS ensign 



ARMY. AIR OFFICER RAMS 

GEN. .... .general 

LTGEN. • *.lt. general 
MAJGEN. . .major general 
BRIGGEN.. brigadier general 

COL colonel 

LTC0L....lt. colonel 
MAJ. .... .major 
CAPT. ... .captain 
1ST LT... first lieutenant 
2ND LT... second licut-iant 
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PUNCTUATION 



MISCELLANEOUS 



CLN. . .colon 
CUA... comma 
PARA. .paragraph 
PAREN . parenthes is 
PD.... period 
RPT. ..repeat 



CG . . . .commanding general 

CO.... commanding officer 

COM. • .commander 

COMDT.comroandant 

DET. • .detachment 

ETA. ..estimated time of arrival 

ETD. * .estimated time of departure 

GMT. . .Greenwich mean time t 

^ i i *- 



HQ. * .head quarters 
LAT. .latitude 
LONG • longitude 
LTR.. letter 
MSG. .message 
NR... number 
RE. . .reference 
UR... your 



It * The Identity of the person origlpating a military message may 
appear as a signature at the epd of a Dies cage and the_ addressee * c identity 
may appear at the 'beginning; or either, or both, of these may be "burled/' 
in the middle of the message, set off by parentheses* If the signature 
is at the end of the messago,_lt may be preceded by STOP (or PERIOD or 
SIGNED, or both. The identification of the originator or addressee may 
consist merely of his command designation (e.g., COMMANDING GENERAL, 

FIRST ARM!) or it may consist of his name and rank, followed hy COMMANDING 
or some other appropriate amplifying data (e.g., in the Army, his branch 
of service). . ^ ^ > j i , j s j + ^ \ ^ v '" r ‘ , 



Examples ; 



(•£»—■* * 0**5 **■— ^ 



- i 






♦a t- f 
•» J'v, v — * " 



JONES, COLOJpL, ARTILLERY 



COMMANDING OFFICER, THIRD REGIMENT 
COMMANDER, DESTROYER SQUADRON SBC 
SMITH, FLIGHT I.EADER, SECOND SQUADRON 



5. In military communications, long messages are often broken into 
parts, each part subsequently being treated as a separate message. Thus, 
messages arise which begin "PART (number) OF (number) PARTS", or "(number) 
PART MESSAGE PART (number)", often separated from the following message 
text by STOP or simply by an "X". 



B. REMARKS ON ITEMS APPEARING IN GROUND (ARMY) MESSAGES 

T 

1. When mention of an army unit appears in a military message, its 
size (echelon) is Indicated, generally preceded by a numerical or literal 
designation and, as further information concerning the unit, its branch 
of service may be included. The several echelons of the U. S. Army, 
listed in descending order of size, ares army, corps, division (DIV), 
brigade, regiment (HEGT), battalion (BN), company 3 - (CO), platoon. Some 
of the branches of service which may appear, as mentioned above, are: 
Infantry (INF), Artillery (ARTY), Signal Corps (SIG C), Armor, Ordnance 
(ORD), Engineers (ENG), Quartermaster (QM). 

Examples of unit designations: 

(a) "A" Company, 39th Infantry Regiment, 9 th Infantry Division 



1 An artillery unit at this echelon is termed a battery . 

fc-3 
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(b) 1st Armored Division 

(c) 57th Signal Battalion 

2. In connection with 1, above, an army Is the normal comuond of a 
gencial (four stars); a corps being the command of a lieutenant gc.nej.-'l, 
a division, the command of a major general; and a brigade, the conn. and. of 
a brigadier general. A regiment is normally commanded by a colonel, a 
battalion may be commanded by a "lieutenant colonel or a major; a company, 
by a captain, and a platoon, by a lieutenant. 

3. For reference piirpones, when giving locations of units, readily- 

recognizable landmarks such as hills, crossroads Tyad road junctions may 
he referred to in terms of their altitude above sea level (in number of 
feat), if such landmarks do not bedr proper names which are suxtcblc fex 
the purpose. Thus, a reference, in a military message, to CR03SF0ADS 
SIX FIVE ZERO would apply to that particular crossroads within r pre- 
selected area which is located at an 'altitude^ of 65 b feet. If, within 
any preselected area of reference, there are two or more landmarks of any 
given type which are hoth at the carae^altityde, it is necessary to affix 
a distinctive letter or number to the r albitude designation of each, in 
order to clearly identify a particular one, thus, such a reference as 
CROSSROADS SIX FIVE ZERO DASH /hyphen/ B may be encountered. In this 
connection, CROSSROADS may be found abbreviated as "CR”, and ROADJURCTIOt'f 
as ’'RJ” • . _ 

it. 

4 . The location of any particular unit may be specified in tejms of 
its location with respect to a particular place or locality, or to a 
particular landmark. However, its location may also be specified by 
Gtating how it is located on a specific map or portion of a map. This 
gives rise in military messages to phrases such as COORDINATES ONE FIVE 
POINT TWO FOUR DASH ONE NINE FOUR POINT SEVEN, wherein the numbers before 
the "dash” indicate the unit's location with respect to the horizontal 
grid lines of a preselected map and the numbers after the "dash fl indicate 
its location with respect to the vertical grid lines. 

5 . Specific highways, turnpikes, and other roadways are often 

identified in military messages by stating the place names of their termi- 
nal points; thus the highway running between Grizurbeto and Bolzano could 
be called the GRIZURBETO DASH /hyphen/ BOLZANO HIGHWAY. Similarly, when 
reference is made to an imaginary straight line across the terrain m a 
poi ticular area, such a line may be identified by specifying any recog- 
nizable landmarks between which the line runs, for example, LIIU CROSSROADS 
THREE ONE FIVE DASH ROADJUNCTION TWO NINE EIGHT. „ , , 

6. Included below is a brief list of frequent words appearing in 
3ov-echelon ground traffic, the abbreviation for certain ones are appended 
in parentheses after them. In addition to the words listed below, nambeis 
and renke/titles will be found to have a high frequency of occuricnce. 

ACROSS ADVANCE AIRPLANE 

ACTIVITY ADVISE AMMUNITION ( M* 0) 

ADDITIONAL AFTERNOON AliFA 
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ARMORED 


_ HILL 


RADIO 


ARMY 


HOSTILE 


RAILROAD 


ARRIVE . 


IDENTIF ICATION ( IDENT) 


READY 


ARTILLERY (ARTY ) 


IMMEDIATELY 


RECEIVE 


ATTACK 


INFANTRY (INF) 


REC0N1TAISSANCE (RCN) 


BARRAGE 


INFORMATION (INFO) 


REinSRENCE (RE) (REF) 


B/A1TALJ0N (BN) 


LARGE 


REGIMENT (REGT) 


BRIDGE 


LEFT 


REINFORCEMENTS 


CAPTURE 


LIGltT 


RETORTS 


CASUALTIES 


LINE 


REQUEST 


COMMA 


LOCATION 


REQUIRE 


COMMAND POST (CP) 


MACHINEGUN (MG) 


REQUISITION 


CO^IUN JCATI ON (COMM) 


MESSAGE (MSG) 


RESERVES 


COMPANY (CO) 


MORNING 


RIGHT 


CONCENTRATION 


MORTAR 


RIVER 


COUNTERATTACK 


MOUNTAIN 


ROAD 


CROSSROADS (CR) 


MOVE 


ROAD JUNCTION (Rj) 


DAILY 


MOVEMENT 


ROCKET 


DASH 


NEAR 


SEND 


DEFEND 


, NEUTRALIZE 


SMALL 


DEFENSIVE 


.NIGHT 


SOUTH 


DISPOSITION 


NORTH 


STOP 


DIVISION (DIV) 


NOTHING 


SUPPLY 


EAST 


-OBJECTIVE 


SUPPORT 


EMPLACEMENTS 


OFFENSIVE 


TANKS _ . 


ENEMY 


(STICER 


TODAY 


ENLISTED PERSONNEL 


ORDER 


TOMORROW 


FIRE 


r PATROL 


TONIGHT 


FLANK 


PLANE , 


TROOPS 


FORCE 


POSITION 


VICINITY 


FROM (FM) 


PREPARE 


WEST 


HEADQUARTERS (HQ) 


PRISONER 


WOODS 


HEAVY 


'PROCEED 


YESTERDAY 



C. REMASKS ON ITEMS APPEARING IN NAVAL MESSAGES 

1# Mention of various sized groupings of vessels are found In 
messages of naval origin, among which those mentioned below are quite 
frequently encountered, A major naval force is called a fleet, and the 
levels of echelonment (or subdivision) within a fleet are the task force, 
task group, and task unit (in descending order of size). The basic unit 
of all fleet vessels^ is termed a division, and is comprised of two or 
more vessels of the "same type;* in this connection, when mention is made 
of a division in a naval message, the particular type of vessel df whlcli _ 
the division is made tip is often specified; for example, CRUISER DIVISION, 
A squadron is made up of two or more divisions of submarines, destroyers, 
landing ships or other light vessels, and a flotilla comprises two or 
more such squadrons. 
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2* In connection with 1, above, a fleet is normally coamnadcd by rn. 
admiral (four stars), a tack force being the command of a vice admiral; 
and a task group, the command of a rear admiral* Furthermore, in time of 
war the officer In command of a convoy or flotilla often holds the rank of 
commodore; the officer commanding an Individual chip may range In rank 
from captain on down, depending on the type of ship. 

3* A list of the main combat vessels is Included below, the approx- 
imate maximum speed of each, which is expressed in KNOTS, is shown in 
parentheses - 1 



BATTLESHIP 

CARRIER 

CARRIER ESCORT 
CRUISER 
DESTROYER 
DESTROYER ESCORT 
SUBMARINE 



(35) 

(35) 

(15) 

(30) 

(35) 

(25) 

( 20 , on surface; 
10, submerged) 



When a particular vessel is mentioned in a naval message, it may be identi- 
fied by a numerical designation, by a group of letters, or by some proper 
name. 



4. In naval messages, the direction of an Object from a ship, or the 
course of a particular naval vessel or unit at sea io given as a compass 
hearing expressed in degrees (from 0 to 359)* for example, BEARING ZERO 
EIGHT FIVE* In some instances the statement of a bearing will be followed 
by the word TRUE or MAGNETIC, indicating that the bearing is measured from 
the geographical pole (true north) or the magnetic pole (not corrected for 
variation), respectively. 

5* The position of a particular naval vessel or unit at sea may be 
specified in a naval message by stating its latitude and longitude in 
degreoa and minutes. The latitude may bo from 0 to 90 degrees and the 
longitude from 0 to 180 degrees, a specified latitude is generally followed 
by NORTH or SOUTH (as appropriate) and, similarly, longitude is followed by 
EAST or UEST. For example, LATITUDE THREE ZERO DEGREES TOO ONE MINUTES 
NORTH LONGITUDE ONE FOUR TOO DEGREES ONE SIX MINUTES TOST, or more briefly 
LATITUDE THREE ZERO DASH TOO ONE NORTH LONGITUDE ONE FOUR. TOO DASH ONE SIX 
TOST* If -position is stated in conjunction with a bearing, it is not 
necessary 'to' state both latitude and longitude, and the location, NORTH or 
SOUTH, with respect to the equator or EAST or TOST with respect to Green- 
wich Meridian may be' found omitted where no ambiguity arises. Positions 




| > ■=* i*% f * * I 1 
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6. The following words may he expected to appear frequently in 
a selection of naval messages of various types: 



AIRCRAFT 


EXECUTE 


RADAR .. 


ALTITUDE 


FLEET 


REJOIN ‘ 


BEACH 


' ■"* T FLIGHT 


REJIDEZVOUS 


BLOCKADE 


, FORMATION 


SAiL 


BCH3ED 


KIEL 


A SEA 


CARGO 


GUARDING 


SHIFT 


CHANNEL 


HARBOR 


SHIP , , 


COASTAL 


KNOTS 


SORT! 


ccrmHD 


LATITUDE 


SQUADRON 


CONTACT 


LONGITUDE 


^ STARBOARD 


CONVOY 


MILES 


STRAFED 


CORRECTED 


MINE (FIELD) 


STRAIT 


COURSE 


MISSION 


^ TARGE? 


CRAFT 


NAVAL 


VESSELS 


DEPART 


NAVY 


VIA 


DEPLOY 


OPERATIONS 


VOYAGE ( 


EMBARK 


PILOT 


WEATHER 


ESCORT 


PORT 





D. REMARKS ON ITEMS APPEARING IN AIR MESSAGES , _ 

1. The various elements of which an air force Is composed and which 
■may he mentioned. in air messages are given below, The smallest 'grouping 
of aircraft, composed of one or more aircraft of a particular type, is 
called a flight « A squadron is two or more flights of the same type; a 
group Is made up of two or more squadrons, a wing comprises two or more 
groups, an air division is composed of two or more wings, and two or idore 
divisions constitute an a?r force. 

■■■ i __ d X 

2. In connection with 1, above, a flight is usually commanded \>y a “ 
major, a squadron being the command of a lieut. colonel,^ a group being 
the command of a colonel, a "wing, _the commancf of a brigadier general;" 
and an air force, the command "of a major general. 

3. Some of the types of aircraft which_ may be mentioned frequently 
in air messages are listed below, an Indication of the range of speed of 
each type, expressed in KNOTS, is shown in parentheses. 



BCJ1BER 
CARGO PLANE 
FIGHTER 
JET EC23BSR 
JET FIGHTER 
LIAISON PLANE 



(250-400) 
(150-350) 
( 250 - 500 ) 
(350-600) 
(250-500) 
( 65-150) 



4. The position of a particular aircraft may be specified In an 
air message by stating its latitude and longitude in degrees and minutes, 
sometimes in conjunction with its altitude in feet. (Latitude may be 
from 0 to 90 degrees and longitude from 0 to 180 degrees.) 



1 1 * 
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1. Weather messages . Any generalization on the specific elements 
which a weather message will contain would perforce be rather tenuous, 
the contents of a particular weather message generally being dependent 
on its source and purpose. However, there are certain elements which 
may be expected.' to appear in most weather messages, these are listed 
below with an Indication of the terms in which each is generally expres- 
sed: 

- ■ t ' 

a. Identification of the originating station (by code number, 

or location). T 

b. ‘ Wind speed (knots) and direction (tens of degrees, from 

00 - 36 ). 

c. Amount of r low clouds (tenths of sky covered) and their 
height (hundreds of, feet). 

d. Types of low, medium, and high clouds (e.g., cumulus, 
stratus, cirrus, etc.). 

e. Temperature of the.a^r and temperature of the dew point 

(both in degrees Fahrenheit). 

* 

"f. ""present and past weather (e.g., clear, partly cloudy, 
cloudy or ovArcast, fog, drizzle, rain, snow, showers, thunderstorm, 
etc.). 

g. Horizontal visibility (miles). 

h. Atmospheric pressure (tens, units, and tenths digits in 
millibars) and barometric tendency (e.g., falling, steady, rising, etc.) 

2. Air-to-ground position reports . Position reports made by air- 
craft in flight may be expected to contain the majority of the following 
elements of information: 

r * - 

a. Position of the aircraft (in latitude and longitude or with 
respect to some locality on the ground). 

b. Time. 

c. Speed. 

d. Altitude. 

e . Weather conditions . 

t '. Estimated time of arrival at next reporting point or’ at 
destination. r , ^ „ 
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*- - * -r ^ ^ 

F. STEREOTYPIC BEGINNINGS AND ENDINGS 



Within the confines of the comparatively limited scope of military 
messages, stereotypy of phraseology is inevitable^ Particularly In the 
beginnings of messages is this limitation apparent: thus these positions 

lend themselves most readily to attack by the cryptanalyst. The follow- 
ing Hat of stereotypes have a high frequency of positional occurrence, 
and therefore may provide a fruitful source for cribs. It Is to be noted 
that a stereotypic initial word often may suggest a whole opening phrase. 
For example, if a message of low-echelon ground origin begins with the 
word HEAVY, then it Is not too unlikely that the opening phrase Is "HEAVY 
ARTILLERY (FIRE, BARRAGE) (FALLING, INTERDICTING)....," which might be 
expanded into "HEAVY ARTILLERY FIRE FALLING ON ODR POSITIONS (NORTH, EAST, 
SOUTH, WEST) OF...." 



BEGINNINGS 



ACKNOWLEDGE 

ADVANCE 

ADVISE (THIS COMMAND) 

(THIS HEADQUARTERS) 

ARRIVE 

ATTACK 

ATTENTION 

CANCEL 

CITE 

CO:CJAIiDING (GENERAL) 

CCL-IUNICATIGN (OFFICER) 

CONCENTRATE (-ION OF) 

CONFIRM 

CONTINUE 

DEPART (-URE) 

DISCONTINUE 

EFFECTIVE 

ENEMY 

EQUIPMENT 

EXPEDITE (SHIPMENT) 
FOLLOWING (ARE) (IS) 

FOR 

FROM 

HEAVY 

HOSTILE 

INFORM (-ATION) 

IN REPLY (TO YOUR) (MESSAGE) 
LOCATION (OF) 



NUMBERS (1, 1st, 2, 2d, etc.) 

ORDERS 

OUR 

PARAPHRASE 

PLEASE 

POSITION 

PREPARE (TO) (-ATIONS FOR) 

PROCEED 
* RECEIPT 
RECEIVE 

RECOMMEND (-ATION) (-ED) 

REFER (-RING) (TO) (YOUR) 

REFERENCE (YOUR, MY) (MESSAGE, RADIO- 
GRAM, TELEGRAM) (NUMBER) (DATED, OF) 
REPEAT 
REPORT 
REQUEST 
REQUIRE 
RERAD 
REURAD 
SEND 

SITUATION REPORT 
STATUS REPORT 
SUPPLY 
VERIFY 

YOUR (COMMAND) (ORGANIZATION ) 



ENDINGS 



ACKNOWLEDGE 
ADVISE (IMMEDIATELY) 

CONFIRM 

END 

END OF MESSAGE 
IMMEDIATELY 

NUMBERS (l, 1st, 2, 2d, etc.) 



PERIOD 
REPLY 
REFERENCE 
REQUESTED 
SIGNED (NAME) 

STOP 

TITLES (MAJ, COL, etc.) 
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APPENDIX 5 

LETTER FREQUENCY DATA - FOREIGN LANGUAGES 



* rt^r* 1 •' J 

■" I (i M ' 

— MW ' 

Section Page£ 

A* German letter frequency data....... 3-6 

B. French letter frequency data...... 7-10 

C. Italian letter frequency data Ii-l4 

D. Spanish letter frequency data. 15-18 

E. Portuguese letter frequency data. 19-22 

F. Russian letter frequency data. 23-26 

1 ** r 
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She letter frequency data contained herein has been compiled from 
selected newspaper and magazine articles comprising war communiques and 
other military- type text. In the material which was processed there 
were place names and words foreign to each particular language, these 
words account for the presence <of certain non-characteristic letters in 
the data given herein for those languages which make use of the Roman 
alphabet* 
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A. GERMAN LETTER FREQUENCY DATA 



1-a, Absolute freque n cies of single letters of German plain text, orrcmged 
alphabetically, based o^60j6>i6 lotters ol* text ., (The letCern X and Y are' ' 
derived from foreign words contained in Geraan plaint text). 



A 


3,601 


G 


1,921 


L 


1,988 


Q 


6 


V 


523 


B 


1,023 


H 


2,477 


M 


1,360 


R 


4,339 


W 


099 


C 


1,620 


I 


4,879 


N 


6,336 


S 


4,127 


X 


12 


D 


3,248 


J 


192 


0 


1,635 


T 


3,447 


Y 


24 


E 


10,778 


K 


747 


P 


499 


U 


2,753 


Z 


654 


F 


958 

















60,046 



1-b. Monographic kappa plain, German language ■ .0787 



1-c • Frequency distribution of single letters based on 60,046 letters of 
German plain text, reduced to ] ,000 letters, arranged according to their 
relative frequencies. T 



E 


180 


T 


57 


G 


32 


F 


16 


N 


106 


D 


54 


0 


27 


W 


15 


I 


81 


U 


46 


C 


27 


K 


13 


R 


72 


H 


4l 


M 


23 


Z 


11 


S 


69 


L 


33 


B 


17 


V 


9 


A 


60 








' 







8 

3 



_ . 1,000 

1-d . Percentage of vowels, high-frequency consonants, medium frequency con - 
sonants, and low-frequency consonants in 60 ,04 6 letters of German plain text . 
Percentage of 8 most frequent letters in German plain text I ~ 

Vowels A,E,%0’u,' ’mEl Y = 39-4# 

High-Frequency Consonants D,N,$,S, and T ■ 35*8# 

Medium^FrequePc y Consonants #,C,F,G,H,L,M, and W * 20.4# 

Low r Frequency Consonants J,K,P,Q,V,X, and Z • 4.4# 

1 I I ■* I I l • 

» (In descending order Of frequency) 

v V m t > f 

8 most frequent letters E,N,'l,R,S, A,T, and D = 67 . 9 # 

1-e. Absolute frequencies of single letters as initial letters of 9.568 
words in German plain text, arranged according to their frequencies . 



D 


1,716 


U 


550 


z 


343 


K 


263 


A 


762 


N 


544 


M 


339 


P 


181 


S 


698 


G 


46l 


N 


306 


R 


167 


E 


686 


B 


46o 


F 


280 


L 


158 


I 


581 


V 


4o8 


H 


265 


J 


135 



135 

106 

22 

2 



9355 



5-3 








REF ID:A56892 



2-a . Frequency distribution of dlrrupho baaed on 60,0h6 lettera of 
German plain text, reduced to 5,000 dierapho * 

BCDBFOBXJKLHHOPQRBTUVVXY 



Z 
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2~b . Digraphic kappa plain, German language « .0111 



2-e . The 95 digraphs comprising 75$ of German plain text, based on the 
table of ,000 digraphs (item 2~a)Y atTa'D^d aceQrding to their relative 
frequencies . 



m 


235 


RE 


60 


NA 


39 


ED 


ER 


195 


Dl 


60 


LI 


37 


SI 


CH 


130 


HE 


58 


UE 


37 


HA 


DE 


127 


NG 


57 


RI 


37 


DA 


HD 


118 


ST 


57 


AU 


36 


EL 


IE 


93 


SC 


55 


NS 


36 


US 


El 


90 


IS 


53 


NI 


35 


hT 


TE 


88 


BE 


48 


RD 


35 


AS 


III 


84 


AN 


48 


m 


34- 


LE 


GE 


7 8 i 


SE 


46 


AG 


33 0 


NT 




1235 


IT 


44 




2508 2 


ZU 


UN 


76 


SS 


40 


ON 


33 


LA 


ES 


68 


TI 


40 


AL 


33 


ME 


HE 


64 


IC 


39 


EG 


32 


HU 



31 


TA 


25 


HR 


20 ' 


H V 

TU 


16 


30 


EM 


25 


LL 


20 


WA 


16 


29 


EH 


24 


VE 


19 


UF 


15 


29 


EU 


24 


RO 


19 


FE 


15 


28 


WE 


24 


OR 


18 


EW 


15 


28 


HT 


23 


UR 


18 


AB 


l4 


20 


AT 


23 


NN 


18 


HI 


14 


27 


AR 


22 


RT 


18 


TR 


14 


27 


ES 


22 


OL 


18 


SA 


l4 


27 


EB 


22 


IG 


18 


MI 


14 


27 


VO 


23 


NW 


17 


HZ 


14- 


26 


NU 


20 


TD 


17 


UT 


14 ' 


26 


WI 


20 


MA 


16 


SD 


13 


26 


TS 


20 


SO 


16 




3750 



— V \ \ *■ 

2-d . Frequent digraphs la German plain text whose reversals are also fre - 

? uent, accompanied by their frequencies from the table of 3,000' fl*qrapho 
Item 2-a) . 

/ rr r -r m 



EN 235 ! HE 58 
ER 195 ! RE 60 
DE 127 | ED 31 



IE 


911 


El 


90 


ES 


68 j 


SE 


46 


AN 48 j 


NA 


IN 


84 


NI 


35 


IS 


531 


SI 


30 


IT 44 | 


TI 


GE 


78| 


EG 


32 










- 





39 

40 



2-e . Frequent digraphs In German plain text whoso reversals are rare or In - 
frequent^ accompanied by their frequencies from the tabic of 5 <000 digraphs 
(item 2-aT7 - - j 

r " “*■ ** “v \ -«-»■» 

CH 130 j HC 0 | ND 113 I DN 2 | NO 57 | ON 3 | SC 55 j CS 0 

+ C if 1 



2-f. Doublets occurring In German plain text,. arranged according to their 
frequencies from the table of 5 , 000 digraphs (Item 2-a) . 7 I " 



SS 


40 


« 

EE 


13 


FF 


7 


RR 


6 1 


gg’ 


\ 

2 


PP \ 2 1 


9 * * 

00 . 1 


LL 


20 


MM 


11 


TT 


7 


AA 


4 


11 


2 


HH ( 1 4 | 


UU 1 "1 


NN 


18 


DD 


8 














-j ^ 


* -J - 



2~g. The 22 digraphs appearing 100 or more times as ^beginnings of words ip 
9,568 words in German plain text, arranged according to their- absolute fre- 
quencies . , - 



D$ 805 
DI 567 
UN 428 
AU 318 



El 


300 


DA 


*244 


WE 


GE 


299 


VO 


214 


VE 


BE 


252 


SI 


197 


WI 



192 


ER 


153 


m 


124 


ST 


- - * 
112 


172 


HA 


l4o 


MI 


117 


IN 


111 f 


155 


AL 


134 


SN 


112 


SE, 


Ws 

*•’* | I « 4 > 



1 

2 



— " * V ♦ 1 , „ _ 

The 10 digraphs above this line compose 25$ of German plain text/ 

The 37 digraphs above this line compose 50$ of German plain text. 



3 -a. The 102 trigraphs appearing 60,046 letter? of 

Gorman plain text, arranged according to their 1 absolute frequencies , 



3-h. 



sen 666 


ERL’ 313 


HEN 198 


AUS 162 


1ST 1*42 


HRE 12*1 


rm 108 


DER 602 


EN 8 270 


SSE 19JL 


T1S 159 


STA l4l 


UER 122 


use 107 


CHE 599 


CRT 26-4 


ESI 190 


DER 157 


DES 1*40 


ACH 119 


EM 306 


DIE 56*1 


KGE 263 


TER 180 


EHI 157 


FUE 139 


GES 118 


ERG 106 


NDE 5*4l 


HDI 259 


REN 185 


ENG 155 


NTE 139 


ABE 117 


RIT 106 


EIN 519 


2ND 254 


EIT 184 


ION 15)4- 


UER 138 


ERA 117 


EHR 105 


END 481 


ERD 248 


EBE 178 


SEN 152 


EKU 137 


BFN 116 


CEA 104 


DEN 457 


IRE -247 


FNE 175 


ITI 151 


TUN 136 


lusir ht 


VON 104 


ICH 453 


AND 246 


LIC 175 


AUF 149 


SEI 133 


RLE 112 


sic 103 


TEN 425 


RDE 239 


EGE 173 


IES 149 


ESE 132 


VER 110 


IGE \Q2 


UNG 377 


ENA 21*1- 


DAS 172 


ASS l*l 8 


ERT 128 


LAN 109 


HE 101 


HEN 332 


EES 212 


ENG 171 


EH* 148 


KDA 127 


EBB 108 


ENZ 100 


UKD 331 


EDE 209 


NUN 169 


ENT 1*46 


IED 126 


ESS 108 


ERB 100 


GEN 321 


STE 205 


NCR 166 


ERI 143 


ERIT 125 


LLE 108 


EUT 100 


ISC 317 


VER 204 


KUIT 163 


EST 142 






- 


The 25 trigraphs 


appearing 


; 50 or more times 


as begirmlngs of words 



frequencies , 



EIR 


2*42 


DAS 


79 


SCH 


73 


AUF 


64 


DEU 


6 l 


UNT 


57 


UEB 


53 


VER 


170 


BRI 


79 


AUS 


69 


NER 


63 


GES 


60 


GRO 


56 


roL 


52 


FUE 


89 


DIE 


76 


SEI 


68 


IUD 


62 


GEG 


59 


AUC 


55 


W1R 


51 


SIC 


86 


NIC 


73 


STA 


85 


ALL 


61 















4 . The 121 totr n qraphs appearing 50 or more times In 60,046 letters of 
German plain text, arranged according to their absolute frequencies . 




SCIIG 


398 


TSCH 107 


ENIN 


80 


RITI 


66 


HERD 


6 l 


ISCII 


317 


RUUD 106 


NICE 


80 


ATIO 


65 


RSCH 


60 


CKEN 


296 


ms 10*4 


UNGD 


80 


GEND 


65 


EDEN 


59 


NDER 


243 


sich 103 


EITE 


79 


TEND 


65 


ERGE 


59 


EIRE 


218 


RUNG 


101 


DEUT 


78 


EBER 


64 


ESSE 


59 


ENDE 


216 


ANDE 


100 


FUER 


78 


GEGE 


64 


UNTE 


59 


ndie 


176 


UNGE 


100 


CUTE 


77 


FDLI 


64 


EICH 


58 


LICH 


168 


EREI 


9*4 


EGEN 


76 


SIND 


64 


TLIC 


58 


ICHT 


151 


TION 


93 


HEIN 


76 


TUNG 


64 


INER 


57 


TISC 


146 


SEIN 


92 


iESE 


75 


FKSI 


62 


EBEN 


56 


ERDE 


14*4 


IEDE 


91 


ERST 


74 


FUTS 


62 


ENDA 


56 


ENDI 


l4i 


LAND 


91 


RDLE 


7*4 


LITI 


62 


ENST 


56 


NDEN 


136 


SSEN 


90 


ERDI 


72 


UEBE 


62 


IGEN 


56 


RDEN 


133 


BRIT 


89 


STLN 


72 


UTSC 


62 


ONDE 


56 


ENTJN 


120 


DASS 


86 


CI3ER 


71 


AUCH 


61 


TENS 


56 


ICHE 


120 


HTER 


86 


INDI 


71 


DENS 


61 


EDIE 


55 


IHDE 


111 


EDER 


83 


REIN 


71 


EIND 


61 


ERTE 


55 


KGEN 


110 


EREN 


83 


HERE 


70 


Oi.IT 


61 


BEEN 


55 


ERUlf 


109 


EKGE 


81 


NGDE 


70 


SOHA 


61 


TDIE 


55 


DIES 


108 


ENAU 


80 


ENBE 


68 


SCHL 


61 


ATEN 


5*1 



A verage length of words in German plain text = 6.3 

5-6 

-l s 



DIEB 

EliZU 

ITEN 

KRIE 

RIEG 

SDEE 

URCH 

ALLE 

DERS 

EKWE 

HABE 

OKEN 

SCHI 

DEED 

DISC 

E11EN 

IIACH 

EDAS 

TRIGS 

AVEN 

KBER 



54 

54 

54 

54 

54 

54 

53 

52 

52 

52 

52 

52 

52 

51 

51 

51 

51 

51 

51 

50 

50 
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B. FRENCH LETTER FREQUENCY DATA 



1-n. Absolute frequencies of single letters of French plain text, arranged 
alphabetically, based on 55,758 letters of text J 



A 


4,480 


G 


624 


i 

L 


’ 2,737 


Q 


6 l 6 


4 

V 


801 


B 


4o6 


H 


276 


M 


1,617 


R 


4,117 


w 


6 


C 


1,944 


I 


4,230 


N 


4,406 


S 


4,564 


X 


317 


m 


2,198 


J 


184 


0 


3,255 


T 


4,057 


Y 


100 


E 

F 


9,334 

646 


K 


25 


P 


1,689 


U 


3,045 


Z 


84 



1-b. Moxxographlc kappa plain, French language - .0777 

l-c . Frequency distribution of single letters based on 55.758 letters In 
French plain text reduced to 1,000 letters, m .cl arranged according to their 
frequencies. , ‘ “ " * r * 



E 


167 


T 


73 


c 


35 


G 


‘ 11 


J 


3 


S 


82 


0 


58 


p 


30 


Q 


11 


Y 


2 


A 


80 


'U 


55 


M 


29 


B 


7 


Z 


2 


N 


~79 


L 


49 


V 


l4 


X 


6 


K 


1 


I' 


76 


D 


39 


F 


12 




5 


W 


- 


R 


74 














- 


1,000 



1-d. Percentage of vowels, high-frequency consonants, med i um-frequency eon- 
sonants, and low-frequcncy consonants in 55,75# letters of Frenhh plain text . 
Percentage of 8 most frequent letters in French plain text . ~ 

Voxels A,E,I,0,U, andY = 43.8$ 

High-Frequency Consonants N,R,S, and T = 30.7$ 

Medium-Frequency Consonants C and P ■ 18.3$ 

Low-Fr^queniJ- Consonants B,FjG,H, J,K,Q,V,W,X, and Z = J.2 

• ~ ‘ ! 2 i ? . - 1 * ' * / ' V * v * - 

(in descending order of frequency) 

• i 1 % 4- 1 J ' - j - 

8 most frequent letters* E,S,A,N,I,R,T, and 0 *’68.9$ 

» ■* T " < < ' l _ , 

- f 

1-e. Absolute 11 frequencies of single letters as initial letters of JLO, 748 
•words in Frenclx plain text, arranged according to their frequencies. jOne - 
letter words have been omitted) . 



D 


1445 


L 


784 


I 


315 


U 


240 


P 


929 


S 


664 


F 


313 


0 


177 


E 


894 


Q 


394 


T 


305 


G 


146 


A 


866 


R 


389 


N 


278 


B 


115 


C 


816 


M 


337 


V 


263 


J 


98 



9,^53 
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2-a. Frequency distribution of dlgrapho baced on 55,758 letters of 
French plain text, reduced, to 5,000 digraphs* — - 





A 


B 


C 


D 


E 


F 


G 


H 


I 


J 


K 


2<J L«t*<r 

L M N 


0 


p 


Q 


R 


s 


T 


U 


V 


w 


X 


Y 


z 


A 


* 


(> 


20 


12 


4- 


6 


II 




SO 


1 




36 


12 


68 


1 


2| 


3 


41 


m 


46 


a 


13 


B 




2 


• 


B 


B 








4 








4 






12 






4 




i 


5 


2 


1 


2 


B 


a 


B 


B 


B 


C 


B 




6 




47 






II 


io 






s 






48 




E 


4 


1 


8 


8 






B 


B 


B 


D 


18 






1 


101 






1 


20 


1 






1 




to 


1 




6 


2 




26 










B 


E 


•30 


4 


41 


48 


30 




14 


3 


13 


5 




56 


58 


105 


4 


38 


□ 


81 


154 

_ .. 


58 

.. . 


27 






8 




a 


F 


IO 




2 


1 


1 


6 






8 






1 






8 


1 


B 


10 


1 




1 












G 


6 








16 




1 




2 






3 


1 


7 


6 


B 


B 






4 


2 




B 


fl 






H 


6 








6 




i 




4 






■ 






3 






1 






4 




■ 


B 






I 


1 


•5 


12 


10 


41 


4 


□ 






1 




m 


8 


4? 


51 


5 


12 


27 


S? 


47 




1 




7 




i 


J 


B 








6 




















5 












B 












K 






























1 












B 












* L 


57 




1 


s 


IS 


1 




1 


23 






m 




3 


lo 


fl 






B 


□ 


12 




B 




I 


B 


§M 


12 


1 


1 


1 


52 








23 








13 




8 


□ 


■ 




B 


B 


4 




i 


B 


| 


B 


*Ln 


»1 


1 


21 


40 


54 


1 


li 


1 


20 


D 


B 


B 


B 


10 


11 


□ 


Q 


Q 


E3 


Q 


4 


7 




B 


| 


B 


0 




5 


7 


■3 


1 


» 


2 


1 


21 


0 


i 


m 


i 


101 




B 




27 


13 


3 


52 


2 


B 


B 


n 


B 


p 


"SO 




1 


1 


m 






2 


3 






ii 


B 




3S 


B 




34 


I 


6 


4 


- 


■ 








Q 






1 


























B 










54 












R 


62 


2. 


10 


13 


127 


2 


6 




24 


I 




16 


II 


,8 


27 


S 


3 


7 


H 


11 


6 


7 








i 


3 


4-1 


2 


16 


32 


75 


5 


2 


1 


36 


2 




m 


8 


6 


22 





il 


8 


□ 


33 


24 


4 






i 




T 


40 


1 


7 


22 


78 


4 


1 


2 


67 


1 




12 


4 


4 


Q 


□ 


7 


44 




ID 


II 


2 










U 


,»2- 


3 


|o 


5 


31 


4 


3 


1 


24 


3 




13 


6 




| 


B 


l 


48 


26 


11 


1 


8 




13 


i 




V 


1 








24 








16 










* 


Q 


B 




5 






2 












W 






























B 








B 


B 




fl 


B 


B 






X 


4 




3 


3 


•3 






1 


1 




■ 




i 


,i 


B 


B 


B 


B 


B 


B 




B 


B 


B 






Y 


2. 








B 


I 


H 








■ 






B 


0 


B 




B 


Jg 


B 


B 


B 


B 


B 




fl 


Z 








— — 


i 


i 


s 




1 




B 






■ 


B 


E 




B 




■ 


B 


E 


B 


3 




B 
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2-1). Digraphic kappa plain, French language = .0093 



2-c . The Of digraphs comprising 75$ of French plain text, based on the 
tab le of 9 t 000 digraphs T^tom 2-a) , arranged according to their relative 
fi cqucncicq . 



.ES 


■-154 


1,237} 


EC 


49 


I HD 


40j EE 


30 


SP 


24 


HI 


20 


RE* 


127 


sr 


' 58 


nr 


40 


2,5151 


HC 


29 


su 


24 


DT 


20 


OH 


'109 


EM 


58 


ED 


UQ 


TA 


40 


AU 


29 


RI 


24 


Cl 


20 


'W 


109 


LA 


57 


CO 


48 


UE 


39 


IR 


27 


VE 


24 


AC 


20 




105 


EL 


56 


UR 


48 


EP 


38 


EU 


27 


TS 


23 


UT 


19 


HT 


99 


QD 


54 


CE 


47 


AL 


3<5 


IL 


27 


MI 


23 


HO 


19 


LE 


95 


HE 


54 


IT 


47 


SI 


36 


RO 


27 


LI 


23 


RT 


19 


ER 


89 


HS 


53 


AT 


46 


PO 


35 


OR 


27 


SO 


22 


HA 


19 


TE 


”78 


ME 


52 


TR 


44 


PR 


34 


DU 


26 


m 


22 


DA 


18 


SE 


75 


IS 


52 


SA 


42 


ST 


33 


LL 


26 


TD 


22 


AS 


17 


AH 


68 


OU 


52 


IE 


4l 


SD 


32 


US 


26 


AP 


21 


EV 


17 


TI 


67 


10 


51 


AR 


4l 


PA 


30 


UN 


26 


01 


21 






RA 


62 


AI 


50 


SS 


4l 


EA 


30 


UI 


24 


OH 


21 


I 3,751 



2~d . Frequent digraphs in French plain tort tlioso reversals a rc also , 
f requent, acconpanicd hy their frequencies fx~on the table of 9 >000 digraphs 
Xltea 2 -a) 



ES 154 i 8E 75 
RE 127 1 UR 89 
DE 109 ! ED 1^8 
EH 105 ! Zffi 5^ 



LE 


95! 


EL 


56 


TE 


78J 


ET 


58 


TI 


67! 


IT 


47 



RA 62 | AR 4l 
EM 58 ! MB 52 
LA 57 ! AL 36 



is 52 ] si 36 

LU 49 ] CE 47 
AT 46 ! TA 40 



2-e . Frequent digraphs in French plain text ^hose reversals are rare or 
i nfre q uent, a ccom panied hy their frequencies iron the table of 5,000 digraphs 
flfcen 2-aj . 



NT 99 j TN 4 J QU 54 ; UQ I J ES 53 j SN 6 | OU 52 j UO 1 

“ _ ». J 

2-f . Doublets occurring in French plain text, arran ge d according to their 
frequencies from the table of 5,000 digraphs (item 2 -a) . 



SS 


4l 


LL 


26 


NH 


10 


PP 


9 


cc 


6 


M 


2 [ 


EE 


30 


MM 


.13 


TT 


10 


RR 


7 


FF 


6 


DD 


1 1 



1 

1 



2 -g . The 22 digraphs appear jpg 100 or. more times as beginnings of -words 
in 10, 748 words in French plain teach, arranged according to their- absolute 
frequencies . 



DE 


501 


RE 


283 


PO 


222 


CU 


168 


AU 


150 


DI 


124 


CO 


394 


PA 


268 


IN 


178 


CE 


163 


HO 


133 


AL 


122 


QU 

PR 


347 

291 


LE 


240 


SE 


178 


ET 


153 


TR 


127 


UN 


122 



1 

117 

112 

101 



1 The 13 digraphs above this line compose 25$ of French plain te'ct, 
^ The 39 digraphs above this line compose 50$ of French plain text - . 
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3 -a* The 97 trigraphs appearing 100 or more times In 53,758 letters of 
French plain text, arranged according to their absolute frequencies . 



ENT 588 
ION 555 
TIO 433 
ONS 373 
EES 367 
QUE 338 
DES 313 
EDE 305 
EME 288 
ATI 287 
EES 284 
MCE 281 
TEE 280 
MEN 272 



CON 271 
ERE 267 
ANT 238 
ESE 230 
ELA 227 
LEE 216 
FAR' 213 
NDE 211 
SDE 210 
DEL 209 
FEE 206 
OUR 205 
RAN 196 
IRE 191 



EST 188 
ERA 185 
ECO 184 
ESD 179 
OND 175 
LEM 173 
NCE 173 
ELE 172 
ESA 163 
TDE 163 
FEE 162 
SSE 160 
ONT 157 
ANC 153 



ESS 151 
AFE 147 
POU 146 
TER 146 
COM 143 
ESP 139 

ous 139 

AIS 137 
EMA 137 
3ER 136 
NTS 135 
TES 135 
EQU 133 
IQU 131 



NSE 130 
REN 127 
SQU 124 
AIR 123 
EPA 120 
QUI 120 
SET 120 
REC 119 
AND 118 
ETA 118 
SEN 118 
PRO 117 
ISE 116 
REP 116 



EUR 115 
NTA 115 
SER 115 
ESO 112 
DEC 110 
EPR 110 
ALL 109 
ECE 109 
TJNE 108 
RAI 106 
RLE 106 
SSI 106 
ENE 105 
SUR 105 



TRA 105 
ISS 104 
HIT 103 
TEN 103 
DEL 102 
ANS 101 
BLE 101 
QUA 101 
CES 100 
ETE 100 
ETR 100 
0154 100 
TAT 100 



3-b . The 20 trigraphs appearing 50 or more times as Beginnings of words 
In 10,748 -words in French plain text, arranged according to their absolute 
frequencies . 



CON 213 
POU 144 
PEE 135 



COM 129 
PRO 105 
ALL 104 



FRA 93 
PAR 87 
QUA 80 



INT 75 
CEN 72 
NOU 69 



ETA 69 
DAN 68 
RED 65 



SER 61 
TRA 57 
RES 56 



vou 56 

FAI 50 



4. The 82 tetragraphs appearing 50 or more times In 55.758 letters of 
French plain text, arranged according to their absolute frequencies . 



TION 431 


CONS 


98 


LEME 


83 


ERAL 


71 


EREN 


58 


RESS 


55 


MENT 251 


EPAR 


98 


QUEL 


83 


ERES 


70 


ESSE 


58 


IEKE 


53 


ATIO 220 


KESE 


96 


LEMA 


80 


DANS 


67 


NOUS 


58 


IRES 


53 


IONS 208 


ENTE 


95 


PORT 


80 


OUVE 


67 


TRES 


58 


TEDE 


53 


EMEN 200 


LLEM 


93 


ENTS 


78 


EMAN 


66 


EEER 


57 


EQUE 


52 


POUR 136 


FRAN 


91 


EPRE 


77 


SENT 


66 


NDES 


57 


NDEL 


52 


IQUE 128 


PRES 


91 


EDES 


76 


ANDE 


63 


NSE I 


57 


ECOM 


51 


IOND 124 


ENTA 


90 


ESET 


76 


PART 


62 


NTDE 


57 


GENE 


51 


DELA 120 


RANC 


90 


INTE 


7 6 


SDES 


62 


CAIS 


56 


SEIL 


51 


AIRE 117 


ANCE 


89 


ALLE 


75 


ESEN 


61 


ESTI 


56 


ELES 


50 


ONDE 107 


SION 


89 


ANTE 


75 


RAFT 


61 


IT 10 


55 


ETAT 


50 


ECON 102 


COMM 


88 


MAND 


75 


EHTD 


60 


wm 


55 


ILLE 


50 


ESDE 102 


ELLE 


84 


CENT 


74 


SSIO 


60 


NERA 


55 


SQUE 


50 


ONSE 101 


NTER 


84 


QUES 


72 


ENCE 


59 











5* Average length of vords In French plain text ° 5<2 letters . 
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_ C. _3TALXAIf LETTER FBE 9 .UEIICY DATA 

T / • - — - J « 

In a3 1 calculations , accented letters have 130611 combined with -the 
corresponding unaccented letter . , „„ 

u « . t • i * ir fT * » ft 

1-a . Absolute f^equoaclcs <5f ginglo letters of . Italian plain text, arranged 
alphabetically, based on" 57, 906 _ letters of text . _ 

- * rTTTT* •<.<'£ v» • 



A 


6,771 


G 


l,l68 


L 


3,592 

l,44i 


Q 


227 


V 


1,024 


B 


527 


H 


493 


M 


R 


4,037 


W 


13 


C 


2,367 


I 


6,568 


H * 


4,094’ 


S 


2,967 


X 


9 


D 


2,258 


J 


18 


0 


5,022 


T 


4,139 


Y 


l4 


E 


6,784 


IC 


28 


P 


l,6l6 


U 


1,547 


Z 


527 


F 


655 















57,906 



l-h. Monographic kappa plain, Italian language ° .0745 

l-c . Frequency distrihut ton of single letters based on 57 ,906 letters in 
Italian plain text, reduced to 1,000 letters end arranged according to their 
frequencies 



E 


117' 


R 


70 


P 


28 


F 


11 


K “ 


- 


A 


3J7 


L 


62 


u 


27 


B 


9 


J 


- 


I 


113 


S 


51 


M 


25 


Z 


9 


Y 


- 


0 


87 


C 


4i 


G 


20 


H 


9 


W 


- 


T 


72 


D 


39 


V 


18 


Q 


4 


X 


- 


N 


71 



















1,000 



1-d. Percentage of vowels , high-frequency consonants, medium-frequency con- 
sonants, and low-frequency consonants in 57 >906 letters of Ilalian plain 
text . Percentage of o most frequent letters m Italian plain text . 

Vowels A,E,I,0,U, and Y = 46.1$ 

High-Frequency Consonants L,N,R, and T = 27.4$ 

Me&lum-Fre quenc y Consonants C,D,G,M,P,S, and V ■ 22.2$ 

Low-Frequency Consonants B,F,H, J,K,Q,W,X, and Z = 4.3$ 

(Listed in descending order of frequency) 

8 most frequent letters E,A,I,0,T,N,R,L - 70.8$ 

1-e . Absolute frequencies of single letters as initial letters of 10,48l 
words m Italian plain text, arranged according to their frequencies. (One 
letter words have been omittedP 



D 


1,381 


L 


500 


T 


337 


U 


217 


J 


13 


C 


i,o4i 


R 


403 


G 


333 


Q 


172 


w 


9 


S 


885 


N 


396 


F 


298 


B 


153 


K 


6 


P 


830 


E 


374 


V 


263 


H 


69 


Y 


3 


A 

I 


822 

685 


M 


371 

1 

* 


0 


235 


Z 


29 


X 


2 

15781 
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A 

A 1 18 
B 10 



2 -a. Frequency distribution of digraphs based on 57*8^7 letters of 
Italian* plain text, reduced to "g, 000 digraphs . 



B 


C 


D 


E 


F 


G 


H 


I 


J 


K 


24 Letter 

L M N 


0 


p 


Q 


R 


S 


T 


U 


V. 


w 


X 


y 


z 


1 


31 


41 


14 


12 


22 


1 


H 






76 


24 


7 4. 


s 


24 


* 

4 


SI 


2d 


d* 


d 


»* 

24 








a 


7 






7 








10 






1 




% 








4 






’2 










r 








>«LU - 



A*'-* . - jf* 3g|aw»»^^*; < 

■» - _n ^ "*r *«*■ Wh- J. r* l 

. - "* ' - -^4***$, *. » 

• - — ft. ui^issrsvaw »*'• J •- 



- r* ■ - 

'-Vj lyjTi^V ' 



j si* r * > t « 

2~b. Digraphic kappa plain, Italian language »" it) wl 

. ^ - *'■***■•#*. n-tV. 

2-c. The 89 digraphs comprising 75$ of Italian plain text, baaed on the 
table of 5,000 digraphs . (Item 2-a), arranged according to their relative 



frequencies 



ER 


96 


RI 


63 


LL 


52 


ON 


86 


IA 


63 


IC 


51 


TA 


78 


LA 


62 


HE 


50 


AH 


78 


IH 


62 


HO 


50 


AL 


76 


1,260! 


Lfi 


49 


EH 


73 


RA 


62 


IS 


48 


RE 


71 


ES 


6l 


IT 


45 


HT 


66 


TI 


59 


OL 


45 


DE 


65 


ST 


58 


RO 


45 


TE 


65 


AR 


57 


SI 


44 


EL 


65 


TO 


56 


10 


43 


DI 


64 


LI 


56 


TR 


43 


CO 


64 


OR 


55 


OS 


40 


AT 


63 


ED 


52 


AD 


39 



S3 
DA 
37 1 EC 



31 AM 
30 UK 



25 VE 
25 0C 



\ r 

.%»/* *> * 






ose reversals are also 



m the table of 5,000 digraphs 



. ^ 



ER 96 ‘ KB 71 el 66 ; IE 

OH 86 1 NO 50 DE 65 ' ED 

TA §3 AT 63 RA 64 j AR 

AH 78 } HA 32 IH 62 i HI 

AL 76 j LA 62 ES 62 J SE 



i AR ;57 
i HI 37. 



*!+• * 

ar 4 ^ 



?6 | IL 
53 OR 55 J RO 
57 ic 52 « cl 
3 % is ,48} si 
32 AD 41 ! DA 



EH 73 j he 51 1 TI 59 



IC 52 
IS ,48 



Frequent digraphs In Italian 



Infre quent, accompanied 



! IT 45 | AC 39 J CA 32 

'■ “7 

plain text ™p$e reversals are rare -ox 



eg frpxa the stable of 5»000 digraphs 



Item 2-& 




» . 1 


i . n j 




ft * '1 

«* *v '? 

1 - • 


i ™ 


1 

9 - 


[ ST $8 j 


1 TS " l'j 


CH 33! 



2-f . D 

frequencies from the table of 5,000 digraphs (.Item 2-aJ« 



LL 


52 


AA 


18 


II 


11 


m 


10 


3 . -XT' — -f 

FF .7 


MM 


4 


w 


2 


TT 


37 


EE 


1? 


PP 


“11 


GO 


8 


ZZ ’ “5 


00 


4 


DD 


1 


SS 


31 


RR 


12 


CC 


10 


BB 


7 




* 


" 







2-g. The 26 digraphs appearing 100 or pore times as beginnings of words in 
10,48l vords in Italian plain text, arranged according to their absolute 



m sms sm 



CO 543 PE 210 PR 18 k HO 121 f MA *$l2 RE 108 

DE 505 CH 197 QC 172’ PA 153 SO. JL2l UH 111 ES 107 

ST 222 AL 186 HE 169 PO l4l TR 121 SU 109 TE 103 

PI 235 HI 185 RI 162 CA 132 DA 120 

1 The 18 digraphs above this line" comprise' 25^ of Italian plain text. 

2 The 43 digraphs above this line coSrprlse of Italian plain text. 

* -,-*5-13 ■ 

**" ^ r Jr V _3jr 4 ¥ .4^^* 



U I’’- 










REF ID:A568,92 



3 - 1 ). 



an plain 


text, 


, arr 


anged according to 


1 tneir absolute frequencies* 


DEL 


3 j (8 


STA 


215 


ERE 


I69 


ICA 


345 


SSI 


130 


ODI 


ll4 


ESI 


107 


EOT 


348 


ALT 


213 


ZIO 


1 66 


RAH 


145 


IJEL 


D27 


ORI 


114 


COR 


IOC 


ELL 


314 


EDI 


212 


ATO 


165 


STR 


145 


ACO 


125 


RMA 


114 


IAN 


10 6 


CON 


306 


ALL 


201 


NTI 


165 


ALE 


144 


ATI 


125 


A15E 


113 


TAN 


105 


CHE 


27 6 


ITA 


198 


ANT 


163 


ID I 


143 


IDE 


123 


BIT 


113 


ATE 


104 


LIA 


274 


ANO 


197 


ERA 


163 


COM 


139 


ADI 


121 


ODE 


113 


NON 


103 


ION 


265 


OST 


196 


TRA 


160 


ECO 


137 


AND 


121 


PRE 


112 


VER 


103 


ONE 


247 


ERT 


187 


ESS 


158 


LLE 


137 


TEN 


120 


NDO 


110 


ICA 


101 


PER 


238 


ARE 


186 


ATT 


157 


OOT 


136 


0N0 


119 


ONI 


no 


OLA 


101 


EDE 


228 


TAL 


184 


OTO 


156 


TER 


136 


ARI 


117 


AZI 


109 


STI 


101 


OTE 


227 


LIA 


180 


ADE 


155 


TAT 


134 


NTR 


117 


ENE 


109 


OCO 


100 


ICO 


216 


1ST 


174 


EST 


151 


TTA 


132 


PAR 


ll6 


ELA 


107 


RIA 


100 


MEN 


216 


GLI 


171 


RES 


146 


ATA 


130 


TRO 


116 


ERO 


107 






The 


19 trigraphs 


appearing 


; 50 or more tinics 


as beginnings 


of words 



frequencies , 



DEL 


217 


STA 


106 


QUA 


81 


PRE 


62 I 


DAL 


57 


PER 


55 


GRA 


53 


CON 


195 


ALL 


100 


PRO 


75 


NHL 


571 


ANC 


56 


RUS 


55 


STO 


51 


COM 


137 


ITA 


94 


QUE 


74 



















4. The 57 tetragraphs appearing 50 or more times in 57.906 letters of 
Italian plain text, arranged according to their absolute frequencies . 



DELL 209 


ALIA 


99 


ICON 


74 


AGLI 


66 


LIAN 


59 


MEOT 188 


COOT 


93 


VANO 


74 


ICHE 


66 


TORI 


59 


I0HE 160 


ADEL 


92 


ECON 


73 


IDEL 


64 


ALEE 


58 


ELLA 150 


OSTR 


88 


IONI 


71 


ELLE 


63 


ANDO 


58 


ZION 147 


EOTO 


87 


STAT 


70 


NELL 


63 


DALL 


58 


TALI 125 


AMEN 


83 


STRA 


70 


IMEN 


61 


NTRO 


58 


AZIO 106 


ALLA 


Ql 


GLIA 


69 


ANTI 


60 


OCHE 


58 


EDEL 106 


ENZA 


75 


ISTA 


68 


ATTA 


60 


ANTE 


57 


ITAL 106 


OOTR 


75 


ODEL 


68 


PART 


60 


EPER 


57 


ENTE 105 


EOTI 


74 


AGON 


66 











OFER 

RUSS 

TATO 

TEDE 

0C0N 

SION 

TAUT 

STOP 

NOST 



56 
5 6 
55 
55 
54 
53 
53 
52 
51 



5 * Average length of words in Italian plain text =5*5 letters. 



l 



*• 



c 



kA- 



x 



5-14 



* i *i 



*fc 




- 



* 



REF ID : A 56892 



D. SPANISH LETTER FREQUENCY MtA 1 



l" 0 '* Absolute frcqucnc ics of single letters of Spanish plain text, arranged 
alphabetically, -based on 60,115 let bers of text , * ~ , 

1 r ^ 

V *602 

w 2 36 

x 127 

Y 413 

z 182 

- 

, So; 115 



A 


6,68l 


G 


823 


L 


2,174 


Q 


> 346 


B 


199 


H 


367 


M 1 


l,7‘-*0 


R 


4,628 


C 


3,137 


I 


4,920 


IT 


4,823 


S 


4,l4o 


D 


2,687 


J 


190 


0 


" 5,859 


T 


3,180 


E 

F 


7,801 

48l 


K 


22 


P 

* 


.AJ85 


U 

* 


2,172 



— . I* 

. <• 1 t. ' » "? + ( * / 

1-b. Monographic kappa plain, Spanish language" - .0747 

1 , ~ * t - } 

" ’ \ ^ >- !-■« IT# * P " 

1-C . Frequency distribution of single letters ‘based on 60,115 letters In 
Spanish plain text , reduced to 1 , 000 letters , and arranged according fro 
their frequencies . _ _ 



E 

A 

0 

I 

N 

R 



130 

111 

,97" 

82 

80 

77 



s 


69 


u 


36 


r a 

V 


. 10 


J 


J ~ 3 




T 


53 


p 


'Jo 


F 


' 8 


Z 


3 




C 


52 


M 


. 29 


Y 


7 


X 


2 




D 


45 


G 


l 4 


H 


6 


w 


1 




L “ 


36 


B 

1 


13 


Q 


6 


K 


1,000 





1-d. Percentage of vowels, high-frequency consonants, medium-frequency 
consonants , and, low-frequency consonants in 60 , 115 letters pf , Spanish 
plain text. Percentage of 8 most frequent letters m Spanish plain text . 

* £ rr t . t 

Vowels A,E,I,0,U, and Y = 46.3# 

High-Frequency Consonants I\f,R, and S = 22.6# 

Medium-Frequency Consonants C,D,L,M,P, and T = 24.5# 

Low-Frequency Consonants B,F,G,H,J,K,Q,V,W,X, and Z ■ 6 . 6 # 

(in descending order of frequency) 

l v 

8 most frequent letters, E,A,0,I,N,R,S, and T - 69 . 9 # 



* - 



1-e . Absolute frequencies of single letters as initial letters of 10,129 
words in Spanish plain text, arranged according to their frequencies. (One - 
letter words have heen omitted?* ! 



p 


1,128 


L 


435 


Q 


286 


V 


183 


Y 


27 


c 


l,08l 


R 


425 


I 


281 


F 


177 


W 


19 


D 


1,012 


M 


403 


H 


230 


0 


169 


Z 


2 


E 


98 9 


N 


’ 346 


U 


219 


B 


124 


K 


1 


S 


789 


T 


298 


G 


20 6 


J 


47 


X 


— 


A 


761 



















10,129 



Includes N throughout all tables . 

p 

From foreign words appearing in Spanish plain text. 
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REF ID : A56892 



2-b . Digraphic kappa plain, Spanish language e '.0091 



2-c . The 


8? digraphs 


comprising of Spanish plain 


text 


, based on the 


table of -5 


,000 digraphs, (Item 2-a) 


, arrange a 


according to 


their relative 


frequencies . 






~ i 






\ 


* 


t . _ 




l 




Eir 


126 


TE 


67 


IN 50 


NA 


4l 


MA 


32 


IS 


27 


EA 


20 


rs 


119 


/If 




EC 47 


n 


40 o 


SA 


32 


EH 


26 


OA 


19 


ON 


104 


1,26 r 


PI 45 


2/513 


PO 


31 


SP 


26 


FU 


19 


ER 


94 


AD 


a 


EL 44 


CA 


39 


in 


30 


ED 


26 


SC 


18 


RE 


94 


AS 


62 


LA 44 


KD 


37 


PA 


30 


OD 


26 


AT 


18 


NT 


91 


TA 


60 


RO 43 


TI '35 


AD 


30 


AP 


24 


CU 


18 


DE 


84 


DO 


59 


NO 43 


LE 


35 


DI 


30 


IT 


24 


EE 


17 


AP 


8] 


OR 


58 


IA 43 


TR 


34 


ID 


29 


EP 


23 


OB 


17 


Cl 


80 


SE 


5r 


IC 42 


UN 


34 


CU 


29 


SU 


23 


CE 


17 


RA 


74 


CT 


57 


ME 42 


PN 


34 


OP 


29 


SO 


22 


ET 


17 


03 


73 


TO 


5 c 


AL 4l 


OM 


33 


LI 


28 


OL 


22 


DO 


17 


CO 


69 


AC 


5 f + 


SI 4i 


NC 


33 


HI 


28 


1.3 


21 






10 


67 


UE '52 

t 


EE hi 


DA 


32 


OC 


28 


EG 


21 


3,753 


2-d . Frequent 


digraphs In Spanish plain tot vhose 


reversals are also 


frequent , 


accompanied by their 


frequencies 


from the 


table 


of 5»0CC digraphs 


(Item 2-a) 


























“ * 




- -* •* 


^ * 


- r 


r- 


* 






l 








EN 126 


! NE 


4i 


AN 


8l i RA 74 


'as 


62 


! SA 32 


LA 44 


J AL ~4l 


ES 119 


i SE 


57 


Cl 


80 1 IC 


42 


OR 


58 


i RO 43 


EL 44 • LE 35 


ON 104 


; NO 


43 


AN 64 HA 


4l 


AC 


54 


! CA 39 


MA 32 l AM 30 


ER 94 


| RE 


94 


AD 64 ! DA 32 














- 


2-e. Frequent digraphs* i4 Spanish' plain text 


vhose 


reversals are rare or 


Infrequent 


, accompanied by -their frequencies from the 


table of 5,000 


digraphs (Item 


2-a/ 


• 




1 






» 








K. 


NT 91 


' TN 0 


ST 


57 • TS 


°J 


ND 37 


! DN 1 


EC 


33 | CN 0 


10 67 


! 61 


4- 




> » 

* T 


, ; * 










— 




2-f . Doublets 


occurring in Spanish plain text, arra np 


led according to their 


frequencies from the table of 


5,000 


digraphs (Item 2-a). 




EE If | AA 12 


RR 10 | SS 


10 | LL 9 | CC 


>1 


00 4 


HN 3 | 


DD 2 


2-g . The 


21 digraphs 


appearing 100 or more times as beginnings of words 



In 30,] 29 words in Spanish, plain text, 'arranged according to their absolute 
frequencies 



CO 684 
HE 335 
DE 323 



PR 307 
ES 286 
qu 286 



PA 263 
PO 247 
IN 235 



SE 189 
DI 175 
FU 157 



CA 151 
SI 137 

MI 117 



PE 111 
UN 109 
HA 108 



MA 101 
CU 100 
SO 100 



1 , 

The 15 dl graphs above this line comprise 25p of Spanish plain bext f 

^ The 40 digraphs above this line comprise 5<yjo of Spanish plain text. 
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3 -a. The 105 trlgraphs appearing 100 or more times in 60,115 letters of 
Spanish plain text, arranged according to their absolute frequencies . 



ENT 59^ 


ARA 229 


FOR 176 


OSE 147 


ERO 131 


NDE 121 


PER 111 


ION 564 


ONE 227 


TER 174 


ONS 144 


ONT 131 


RAN 121 


ASE 109 


CIO 502 


ESE 217 


ODE 168 


RCC 344 


ANA 130 


STB 119 


CAN 109 


NTE 429 


ADE 202 


ERE 366 


ORE 143 


ARE 130 


REIT 118 


UNI 108 


CON 415 


PAR 193 


ERA 165 


0C0 142 


UHT 129 


ARI 117 


osi 107 


EST 355 


CIA 190 


TRA 165 


DDE l4l 


MO 127 


TEN ll 6 


GEN 105 


EES 335 


ENC 190 


AME 163 


ICI l4o 


TAR 127 


01 © 115 


NCO 105 


ADO 30 7 


NCI 188 


ERI 162 


END 139 


ANT 126 


RIA 115 


RIO 105 


QUE 294 


PRE 184 


MER 159 


SEN 139 


ESA 126 


EC I ll4 


ERN 104 


AC I 277 


DEL 183 


ELA 158 


TAD 138 


IER 126 


1ST 113 


QMOC 104 


NTO 270 


NDO 183 


PRO 155 


ECO 135 


ADA 125 


ONA 113 


SCO 104 


ms 267 


NES 183 


ACO 153 


STR 134 


DEN 124 


DAD 112 


TES 103 


COM 246 


DOS 182 


ENE 151 


TOS 133 


AND 123 


INT 112 


BIE 101 


ICA 242 


MEN 181 


X3ES 150 


IDA 132 


DES 121 


NTR 112 


NTI 100 


STA 240 


uta 176 


ESP 149 


SDE 132 


IDO 121 


ESI 111 


TOR 100 


3-h. The 19 trigraphs 


appearing 


: 50 or more times 


as beginnings of words 


in 10,129 words in Spanish plain text, arranged according to their 


absolute frequencies. 












CON 298 


PAR 154 


PUN 93 


INT 72 


1 

UNI 55 


CUA 52 


REP 51 


COM 218 


PRO 139 


PER 80 


RES 72 


DES 53 


TRA 52 


AEG 50 


EST 194 


PRE 114 


GOB 77 


NUE 66 


INF 53 







4, The 86 tetragraphs appearing 50 or more times In 60,115 letters of 
Spanish plain teacb, arranged according to their absolute frequencies . 



CION 


4)1-4 


CONS 


104 


ERUO 


79 


AMER 


72 


FORM 


62 


AGIO 


252 


CONT 


99 


IERN 


78 


ZEND 


72 


SENT 


62 


ENTE 


233 


PUNT 


95 


OQUE 


78 


IDAD 


71 


ICIO 


6 l 


ESTA 


174 


ANDO 


91 


IONA 


77 


EHDO 


70 


ONTR 


60 


TONE 


159 


TADO 


91 


XJEST 


77 


ERIC 


70 


SION 


60 


MEET 


150 


AGON 


90 


BIER 


76 


NTOS 


70 


CCIO 


59 


ONES 


146 


ANTE 


89 


ICAN 


76 


MIEN 


69 


GENT' 


58 


IENT 


l4l 


KTER 


85 


RESE 


76 


IOND 


67 


COMA 


57 


EIITO 


137 


INTE 


84 


GOBI 


75 


MERI 


67 


ESDE 


57 


ENC I 


128 


NTES 


82 


OBIE 


75 


NTRA 


67 


ORES 


57 


PARA 


117 


ADOS 


81 


ECON 


74 


DELA 


65 


RECI 


57 


ENTA 


115 


AMEN 


81 


RGEN 


73 


ENTI 


64 


AQUE 


56 


NCI A 


115 


OCON 


81 


RICA 


73 


NTHT 


64 


IOIJP 


56 


PRES 


111 


ESEN 


80 


STAD 


73 


COMI 


63 


QUES 


58 


UNTO 


111 


ONDE 


80 















EEST 

SCON 

SIDE 

CIEN 

NF0R 

OPOR 

KESP 

ARI0 

ESTR 

ARGE 

ECTO 

PART 

FOSI 

EPEE 



55 

55 

55 

54 

54 

54 

54 

53 

53 

51 

51 

51 

51 

50 



5. Average length of words in Spanish plain text 



5.9 letters 



5-18 
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E. 



PORTUGUESE TETTER FREQUENCY DATA 




1-a. Absolute frequencies of single letters, of Portuguese pi aim text, arranged 
oTphabeticaily, ^'ascd on^i^^ 3 etters of text . 



A 


5*3^2 


G 


724 


L 


1,245 


Q 


348 


V 


737 


B 


470 


H 


304 


M 


1,699 


R 


3,292 


w 


24 


C 


2,285 


I 


3,31 J » 


N 


2,912 


s 


3,409 


X 


166 


D 


1,900 


J 


160 


0 


5,ooi 


T 


2,679 


Y 


22 


E 


,5,441 


IC 


17 


P 


1,3 n 


u 


1,491 


Z 


207 


F 


520 






- 













457lo& 



1-b. Monographic kappa plain, Portuguese language = .0746 

1-c . Frequency distribution of single letters based on 45,306 letters of 
Portuguese plain text, reduced to 1,000 letters . 



E 


121 


R 


65 


U 


33 


F 


ll" 


X 


4 


A 


119 


T 


59 


P 


30 


B 


10 


J 


3 


0 


111 


C 


51 


L 


28 


Q 


8 


w 


l 


S 


76 


D 


42 


V 


16 


H 


7 


Y 


- 


I 


73 


M 


38 


G 


16 


Z 


5 


K 


mm 


R 


73 












- 


-M 





1,000 



1-d. Percentage of vowels, high-frequency consonants, medium-frequency con - 
sonants, and lov i^equency^ consonants in 4$, 106 letters of Portuguese plain 
text. Percentage of 8 most fioQuent letters in Portuguese plain text . 

Vowels A,E,I,0,U, and Y = 45.8$ 

High-Fre quedey Consonants N,R, and S * 21.3$ 

Medium-Frequency Consonants C,D,L,M,P, and T » 24.8$ 

Low-Frequency Consonants B,F,G,H,J,K,Q,V,W,X,Y, and Z « 8.1$ 



(in descending order of frequency) 

8 most frequent letters E,A,0,S,I,R,N, and T - 69 . 7 $ 

__ r 

_ / 

1-e. Absolute frequencies of single letters as Initial letters of 7»058 words 



in Portuguese plain text, arranged according to their frequencies . 

14 r « - 1 

11 

7 +’-■?. s 

4 . * ; 



P 


847 


M 


405 


I 


264 


B 


313 


C 


731 


T 


343 


F 


222 


G 


111 


E 


608 


R 


316 


Q 


222 


J 


92 


S 


601 


IT 


299 


0 


187 


U 


77 


A 

D 


597 

506 


V 


271 


L 


143 


H 


60 



Z 

W 

K 

.Y 

X 



- v* , 



T7s& 
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2-1). Dlgraphic kappa plain, Portuguese language * .0084 

i 

2-c . The 93. digraphs compi-lslng 75$ of Portuguese plain text, based on the 
table of ^,000 digraphs ' (Item 2 -a), affixed ^ according to their relative ' 
frequencies . ~ 



NT 


ll4 


TA 


85 


ST 


47 


AM 


.38 


CE 


30 


OD 


25 


AT 


22 


EN 


97 


1, 


.224 1 


HI 


46 




,505 a 


HC 


29 


NO 


25 


UA 


22 


ES 


95 


SE 


62 


DA 


45 


HD 


35 


ER 


28 


LA 


24 


OA 


21 


TO 


88 


DO 


6l 


EM 


44 


OP 


35 


IT 


27 


LE 


24 


LI 


21 


EE 


86 


DE 


6l 


IA 


42 


SP 


35 


OE 


27 


AP 


23 


OL 


20 


CO 


85 


AD 


60 


HA 


4i 


RO 


34 


El 


27 


EG 


23 


ET 


20 


OS 


85 


PO 


60 


SA 


4i 


IC 


34 


UE 


26 


VI 


23 


01 


20 


OH 


79 


CA 


60 


SS 


40 


TR 


33 


MI 


26 


SO 


23 


NS 


19 


ER 


76 


AH 


56 


Cl 


39 


DI 


33 


10 


26 


SI 


23 


SU 


18 


- RA 


75 


IN 


53 


IS 


39 


OC 


32 


PA 


26 


OV 


22 


ET 


18 


AS 


72 


AC 


52 


AL 


38 


EL 


31 


TI 


26 


SC 


22 


EP 


18 


OR 


71 


ME 


51 


VE 


37 


ID 


31 


PE 


25 


Hi 


22 


UI 


18 


TE 


69 


AO 


49 


QU 


37 


HA 


31 


IR 


25 


ED 


22 


3j 


,755 


AR 


68 


EC 


48 


OM 


36 










• 









2-d . Frequent digraphs In Portuguese plain text vhoae reversals are also 
frequent, accompanied by thoir frequencies from the table of 5,000 digraphs 
Xltem'2-aT. 



ES 


95' 


SE 


62 


OR 


71! 


RO 


34 


ME 


51! 


EM 


44 


RE 


86 


ER 


76 


CA 


60 } 


AC 


52 


EC 


48 


CE 


30 


CO 


85 1 


OC 


'32 


AD 


60 j 


DA 


45 


MA 


41 1 


AM 


36 


RA 


75 


AR 


68 


PO 


60! 


OP 


35 


Cl 


39 


IC 


34 


AS 


72, 


SA 


4l 


Air 


56! 


HA 


31 


DI 


33 f 


ID 


31 



2-e . Frequent digraphs in Portuguese plain text whose reveraals are rare 
or Infrequent, accompanied by their frequencies from the table of $,000 
digraphs' (I tea 2 -a)" . 

HT 114 S TH 1 1 ST 47 j TS 0 | HD 35 i DN 0 

2-f . Doublets occurring in Portuguese plain text, arranged according to 
their frequencies from th'e''t‘a : ble of ^ > 00Q digraphs (Item 2-a) . • 



SS 


40 


EE 


11 


00 


5 


LL " 


2 


AA 


11 


RR 


11 


CC 


2 


MM 


2 



1 j PP 1 1 TT X 



2-g. The 20 digraphs appearing 100 or more times as beginnings of vords In 
6,803 words in Portuguese plain text, arranged according to their absolute 
frequencies . 



CO 464 
PO 386 
se 333 



EE 276 
DE 259 
QU 220 



IK 188 
ES 173 
PR 169 



PA 143 
HA 133 
TE 132 



HA 130 
EE 122 
VE 115 



ME 111 
MI 105 
HO 104 



TR 103 
DI 102 



1 The 15 digraphs above this line compose 25$ of Portuguese plain text. 

2 The 42 digraphs above this line compose 50$ of Portuguese plain text. 
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3-a. The 59 trigraphs appearing 100 or more times In 45,106 letters of 
Portuguese plain text, arranged according to their absolute frequencies . 



ENT 


474 


TOS 


191 


ERE 


150 


IDA 


133 


OSE 


126 


ECE 


115 


ASE 


105 


NTO 


457 


EST 


186 


CIA 


145 


TER 


132 


ARE 


125 


NCI 


n4 


ITO 


104 


OUT 


303 


ACA 


182 


ADE 


143 


0P0 


130 


ESE 


124 


REC 


113 


ELE 


103 


HTE 


284 


RES 


181 


8 TA 


143 


SPO 


130 


OVE 


124 


PAR 


112 


ERI 


103 


CON 


255 


QUE 


172 


ICA 


142 


ADA 


129 


SSA 


124 


ESS 


110 


PRO 


102 


PON 


236 


NTA 


167 


0C0 


140 


TRA 


129 


DES 


123 


DAD 


109 


AME 


101 


CAQ 


227 


POR 


159 


ARA 


136 


HDO 


127 


ECO 


121 


ORE 


108 


OSS 


101 


ADO 


211 


AGO 


158 


DOS 


134 


ENC 


126 


ODE 


118 


EDI 


107 


IME 


100 


MEN 


205 


COM 


154 


OES 


134 



















3-t>. The 19 trigraphs appearing 50 or more times as beginnings of words 
In 6,497 words in Portuguese plain text, arranged according to their 
absolute frequencies . ” ~ ’ 



CON 224 
PON 213 
COM 136 



QUE 


109 


PRO 


93 


QUA 


83 


TRA 


66 


VEX 


531 


RES 


52 


EST 


105 


POR 


88 


DES 


71 


MIL 


61 


IND 


52 | 


REC 


51 


PAR 


93 


NAO 


86 


SER 


70 


REF 


56 











4. The 38 tetragraphs appearing 50 or more times In 45 >10 6 letters of 
Portuguese plain text, arranged according to their absolute frequencies * 



ONTO 233 


ENTA 


97 


AMEN 


81 


CONT 


68 


CONS 


58 


PONT 221 


NCIA 


95 


PARA 


81 


FORM 


67 


NTES 


58 


MENT 183 


PORT 


87 


COES 


73 


OCON 


66 


ANDO 


57 


ElffiO 173 


DALE 


86 


2DAD 


71 


ELEG 


61 


ANTE 


57 


EHTE 147 


ESTA 


85 


CENT 


70 


ADOS 


60 


ORMA 


54 


ACAO 142 


ENCI 


83 


INTE 


70 


IMEN 


60 


VEXA 


54 


NTOS l4l 


SPON 


83 















BENT 

TELE 

EGRA 

NFOR 

OPON 

LEGR 



52 

52 

51 

51 

51 

50 



5. Average length of voids in Portuguese plain text = 6.48 
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F, RUSSIAN LETTER FREQUENCY DATA 



1-a. Absolute frequencies of single letters of Russian plain text, arranged 

— 



A 


5,122 


3 


1,?80 


H 


U,lt63 


y 


1,578 


•A 


257 


E 


1,0 95 


M 


ii,923 


O 


8,078 


+ 


127 


bl 


1,1)21 


B 


3,51*3 


w 

14 


961 


n 


1,01b' 


X 


9U1 


b 


960 


r 


l,liil 


K 


2,32k 


P 


3,k?7 


u, 


36? 


* 


173 


A 


2,076 


J\ 


2,7k7 


c 


3,917 


4 


902 


»o 


1*55 


E 


5,537 
5 02 


M 


1,936 


T 


h,0hl 


Ui 


55k 


$1 


1,185 

S?^5?3 



l-b. Monographic kappa plain, Russian language a *0568 



1-c. Frequency d istribution o f sin gle loiters based on 67,850 letters of 
Russian p’lain text, F^uced ib~T^T)60 letter s , arrangecT^c^i^ing 'to Itieir 
relative ffoqucncies . 



o 


119 


B 


52 


n 


27 


E 


16 


* 


7 


E 


, 82 


P 


50 


y 


23 


ft 


Hi 


K> 


7 


A 


75 


J\ 


I|0 


bi 


21 


b 


Hi 


u, 


5 


14 


73 


K 


3k 


3 


19 


X 


m 


Ui, 


U 


H 


66 


A 


31 


<a 


17 


4 


13 


> 


3 


T 


60 


M 


29 


r 


17 


uu 


8 


* 


2 


C 


58 


























-- j 


i - 




- 


- 


- T* 



1-d, Percentage of vowols a high-fr e quency c on3o nants 5 med i um- fr equency con - 
sonants 5 3n3~To^ con s onan bs in^fT^^^ t ors of~fiussTan plaiinEext , 

l^rccntage of'TRT mosFT^equent’TSITar s In Russian plain iexEi 7TT - 

-in mir l l . j uni n riLTm - n Minm nrT*. i 1 i 1 1 -| n n n ri - - i - i ^ * 1 ,,L " ' " 4 * 

Vowels A, E , H , Pi , O, y, bl, * ,K> , and 9 a 1*3*1$ 

High-Frequency Consonants B, H, P , c, and T a 28,6# 

Medium-Frequency Consonants E,r, l o l ,^»K,.n,M>n,X,M^ and b n 25,1$ 
Low-Frequency Consonants 4* , u, , ui , and ui, a 2 .6# 

% *• -* •* ■» -™ • • *■ *“ f * ' 

10 most frequent letters (in descending order of frequency) C, E, A, 

14, H,T, C, B, P, andJI a 67.5# 



1-e, Absolute freque ncies of single letters as initial letters of 10,601 
words ^Russian p' lairT toxfc, arrangeef according^Q m Their~fi^ii^nc : ^s , 

'( One-letter words nave' been ori&TteaTy 



n 


1,210 


A 


c 


983 


M 


H 


800 


P 


B 


731 


T 


O 


650 


3 


X 


555 


E 



U96 


14 


321 


W 16 


r 


292 


li29 


y 


229 


iil8 


H 


182 


ItOii 


> 


lltf 


3UU 




II 46 



5-23 



120 


4> 


58 


116 




k7 


92 


S 


111 


72 


to 


3k 


63 


•A 


2 






1(7,1.731 
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2*-a„ P^cquoncy dis tribution of dlcraphs based on 67»Q£0 loiters of 
Russian ' tcH^rccI need to £ T ouD diR Toph s , 

1 4 LeHer 

; s r A s |3 k ji m Hon pct y <}» x_ u, h ua w, m b . $ 10 si 

2 T 3sl s [ 1 1 FTTg rT^ I 7l 7119m 19 r> s 11 vii n 27 3 1 10 (I 7 10 1 2 c o 

G 2 21 8 1 G 3 U 2 



163 



i MBB BIilBfflW 
lilEEBBiEBl 



BBBBBBEB 

IIBEBBBBBBEBBBBEBBIBii 
IIBBBBBBEBBBBBBIBIBH 

sbsbbbI 
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2-b. Digraphic kappa plain, Russian language s .005? 

2-c . The 159 digraphs comprising 75$ of Russ lap plain text) baaed on the 
table of 5 >000 digraphs (Xtcm'2 -a)' , a^anged according to" their" relative 
frequencies . ~ 



0B 


84 


UP 


39 


J10 


30 


EM 


24 1 


AM 


19 


AH 


14 


HP 


11 


BH 


10 


CT 


82 


OM 


38 


Jib 


30 


PM 


24 j 


OK 


19 


DM 


14 


CC 


11 


Ain 


10 


HO 


67 


PE 


37, 


HE 


29 


mi 


24 ! 


TC 


18 


yc 


14 


HE 


11 


JIH 


9 


K0 


66 


IZ50T 


MT 


29 


M3 


23 | 


Bbl 


18 


bH 


13 


An 


11 


PT 


9 


EH 


63 


EC 


37 


Oft 


29 


r TE 


23j 


03 


18 


CJI 


13 


m 


11 


HB 


9 


HH 


58 


JIM 


36 


MC 


29 




MA 


18 


HH 


13 


TH 


11 


BH 


9 


B0 


58 


HH 


36 


TH 


28 


MM 


23 


XO 


18 


HM 


13 


Mft 


11 


HC 


9 


PO 


56 


BA 


35 


OB 


28 


HO 


22 


on 


18 


EK 


13 


yn 


11 


BE 


9 


TO 


56 


3A 


35 


AT 


27 


MB 


22 


EB 


18 


m 


13 


sr 


11 


3H 


9 


PA 


55 


EJI 


35 


TB 


27 


MJI 


21 


CJI 


37 


HM 


13 


HH 


10 


EB 


9 


HA 


54 


AB 


35 


EH 


27 


Tb 


21 


MX 


17 


OM 


12 


E3 


10 


m 


9 


ro 


50 


TA 


35 


AJI 


27 


ME 


21 


m 


17 


>KE 


12 


ftc 


10 


HB 


9 


oc 


50 


HE 


34 


CO 


27 


ME 


21 


BM 


17 


AB 


32 


HH 


10 


HT 


9 


AH 


48 


ET 


33 


KM 


26 


BO 


21 


py 


16 


HA 


12 


AX 


10 


OT 


9 


OH 


4-7 


JIE 


33 


AP 


26 


MM 


20 


En 


16 


HE 


12 


HM 


10 


AH 


9 


no 


46 


or 


32 


TP 


26 


BC 


19 


LJX 


16 


OH 


12 


as 


10 


CH 


9 


OP 


43 


BJ5 


32 


HA 


25 


m 


19 


ns 


15 


TH 


11 


i<y 


10 


yio 


9 


np 


41 


m 


32 


CE 


25 


MO 


19 


A3 


15 


cn 


11 


h y 


10 


30 


9 


OJI 


41 


TE 


31 


JIA 


25 


AK 


19 


Eft 


15 


bC 


11 


KP 


10 


00 9 

3Y5o 


CK 


40 


AC 


31 


KA 


24 


m 


19 


OE 


15 


BH 


11 


Ty 


10 


OT 


39 


OH 


30 

























2-d . Frequent digraphs In Russian plain text whose loverGals arc al so 
frequent, accompanied by their fxcquencicg from the table of 5 >000 digraphs 
Tltem 2-a) 



OB 


84 


; BO 


58 


PO 


56 


OP 


43 


ro 


50 | 


or 


32 


BA 


35 


AB 


35 


HO 


67 


[ OH 


30 


TO 


56 


OT 


39 


OJI 


41 ; 


JIO 


30 


EJI 


35 


JIE 


33 


EH 


63 


HE 


34 


HA 


54 


AH 


45 


EP 


39 i 


PE 


37 


ET 


33 


TE 


31 


HM 


58 


MH 


32 





















2-e . Frequent digraphs in Russian plain text whose reversals are rare 
or infrequent, accompanied by their frequencies from the table of 5,000 
digraphs (ltem2-a)' . 

IIP 41 I pn 2 j CK 40 i KC 3 

2-f . Doublets occurring in Russian plain text, arranged according to their 
frequencies from the' table of 5,000 digraphs (Item 2-a) . 



HH 


24 


CC 


11 


EE 


7 


MM 


3 


| JIJI 


2 j 


HH 


JL 


MH 


19 


00 


9 


BB 


5 


AA 


2 


! TT 


2 | 


KK 


1 



1 The 24 digraphs above this line compose 25$ of Russian p'ain text. 

2 The 66 digraphs above this line compose 50$ of Russian plain text. 
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i i ■» 

2-g , The 24 digraphs appearing 100 or sore tlrses as beginnings of words in 
10,601 words in Russian plain text, arranged, according to their absolute 
frequencies . * ' 1 



np 


470 


PA 


250 


ro 


I 69 


OE 


146 


HO 


120 


KA 


110 


ME 


107 


no 


405 


HA 


246 


CE 


167 


HE 


137 


or 


115 


HE 


no 


BC 


101 


3A 


292 


CO 


220 


CT 


l 6 l 


HE 


122 


3T 


111 


TO 


108 


MA 


101 


KO 


287 


BO 


179 


BH 


159 



















3-a. The 69 trigraphs appearing 100 or more tlre3 in 67 ,8?0 letters of 
Russian plaintext, arranged according to their absolute frequencies 



oro 318 
EHM 2Q5 
CKO 270 
CTB 267 
OCT 260 
npo 233 
CTA 217 
0B0 264 
BOH 203 
EHH 198 



TEJI 188 
HOB 181 
EJ1L 176 
OBA I 69 
opo 167 

CTP I 65 
ECT 159 
AHM 158 
CKH 158 
TOB 158 



TOP 152 
JTEH 151 
noji 149 
JEEH 146 
HHX 145 
HUE 143 
HUH 143 
KOU 139 
HTE 138 
HOC 138 



nPH 137 
PEE 137 
ETC 135 
HHti 135 
OBE 134 
KOB 130 
HHO 130 
COB 130 

itpe 129 
HOr 128 



POfl 128 
KOr 123 
ABO 119 

nEp 119 

TBE 119 
SAB 118 
BAH 117 
KOH 117 
HO/t 117 
EPE 116 



POB 116 
CTH 115 
I ITOi 113 
ACT 112 
AHA 111 
HBIE 110 
OJIL 110 

noc 110 
cto 110 
Ero 104 



HEB 104 
HKA 103 
TBO 103 
AEO 101 
HCT 101 
TPA 101 
BET 100 
OBC 100 
PA3 100 



3-h . The 20 trlgraphs appearing 50 or gore tiires as beginnings of vorda 
in 10,601 words in Russian plain text, arranged according to their 
absolute frequencies 



npo 205 


nPH 


95 


noc 


81 


bbhi 


73 


non 


6l 


CTA 


59 


ron 


51 


npE 116 


COB 


87 


IIEP 


78 


PAE 


72 


EOJI 


60 


PA3 


53 


TOP 


50 


3AB 108 


KOJI 


84 


non 


74 


HAP 


71 


PAft 


60 


KOH 


52 







4. The 58 tetragraphs appearing 50 or more tildes in 67,850 letters of 
Russian plain text, arranged according to their absolute frequencies 



Horo 114 1 


COBE 


87 


EJIBH 


78 


BJIEH 


68 


npoM 


60 


0H3B 


54 


TEJIE 


111 


ABOH 


86 


CTBO 


78 


CKOft 


66 


F0M3 


60 


KOTO 


53 


MTEJI 107 


3AB0 


85 


HqEC 


76 


CTAB 


66 


OTOB 


59 


HHbIX 


53 


Koro 


99 


CTBE 


84 


OBET 


74 


APOH 


65 


BETC 


56 


BOHC 


52 


HOCT 


98 


JILHO 


83 


HECK 


74 


EJILC 


65 


TOPO 


56 


ETCK 


52 


mm 


97 


nOJIH 


82 


AHOB 


72 


ECTB 


64 


ATEJI 


55 


OTOP 


51 


EHHO 


95 


CKOr 


82 


OCTA 


70 


CTAH 


64 


toto 


55 


BCKO 


.50 


EHME 


88 


EHHH 


81 


CTPO 


70 


HAPO 


63 


ETCH 


55 


OJIXO 


50 


EPEH 


87 


AEOT 


80 


BEHH 


69 


OBAH 


62 


CTBA 


55 


CKtfft 


50 


PAEO 


87 


JIEHH 


80 


TBEH 


69 


CTPA 


6l 











5- Ay^a^e_lengtn olLjyQrds_ln ^g a_lan _nla lnJart = 6.4 
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